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GenCore version 6.2.1 
Copyright (c) 1993 - 2008 Biocceleration Ltd. 



OM protein - protein search, using sw model 

Run on: June 24, 2008, 15:17:39 ; Search time 529 Seconds 

(without alignments) 
2416.325 Million cell updates/sec 

Title: US-10-552-515-l_COPY_157_933 
Perfect score: 4123 

Sequence: 1 QQDVQDGNTTVHYALLSASW SELSSHWTPFTVPKASQLQQ 77 7 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 5032670 seqs, 1645091341 residues 

Total number of hits satisfying chosen parameters: 5032670 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : UniProt_12 . 1 : * 

1: uniprot_sprot : * 
2: uniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 

1 4123 100.0 933 1 TM 1 6 G_HUMAN Q6iwh7 homo sapien 

2 3478.5 84.4 859 1 TM16G_MOUSE Q14at5 mus musculu 
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3 


3472 


84. 


.2 


860 


1 


TM16G_RAT 


Q6ift6 


rattus norv 


4 


1505.5 


36 . 


.5 


920 


2 


A6QLE6_BOVIN 


A6qle6 


bos taurus 


5 


1498 .5 


36. 


.3 


955 


1 


TM 1 6 D_HUMAN 


Q32m45 


homo sapien 


6 


1476 .5 


35. 


.8 


903 


2 


Q1AP3 6_STRPU 


Qlap36 


strongyloce 


7 


1475.5 


35. 


.8 


921 


2 


Q1AP3 5_STRPU 


Qlap35 


strongyloce 


8 


1472 


35. 


. 7 


981 


2 


A2AHE5_MOUSE 


A2ahe5 


mus musculu 


9 


1467.5 


35. 


.6 


981 


1 


TM 1 6 C_HUMAN 


Q9byt9 


homo sapien 


10 


1463 


35, 


.5 


900 


2 


A1A5Z4_DANRE 


Ala5z4 


danio rerio 


11 


1460.5 


35. 


.4 


921 


2 


Q1AP3 7_STRPU 


Qlap37 


strongyloce 


12 


1452 .5 


35. 


.2 


960 


1 


TM16A_MOUSE 


Q8bhy3 


mus musculu 


13 


1450 .5 


35. 


.2 


986 


1 


TM1 6 A_HUMAN 


Q5xxa6 


homo sapien 


14 


1437 


34. 


.9 


999 


1 


TM1 6B_HUMAN 


Q9nq90 


homo sapien 


15 


1434.5 


34. 


.8 


913 


2 


Q8CFW1_M0USE 


Q8cfwl 


mus musculu 


16 


1419 


34. 


.4 


896 


2 


Q6DDQ3_XENLA 


Q6ddq3 


xenopus lae 


17 


1413 .5 


34. 


.3 


1014 


2 


Q175J3_AEDAE 


Q175j3 


aedes aegyp 


18 


1413 .5 


34. 


.3 


1117 


2 


Q 0 I EX 5_AEDAE 


Q0iex5 


aedes aegyp 


19 


1412 .5 


34. 


.3 


913 


1 


TM16E_HUMAN 


Q75v66 


homo sapien 


20 


1402 .5 


34. 


.0 


906 


2 


Q7QDY0_ANOGA 


Q7qdy0 


anopheles g 


21 


1382.5 


33 . 


.5 


412 


2 


Q4SC9 7_TETNG 


Q4sc97 


tetraodon n 


22 


1382.5 


33. 


.5 


1235 


2 


Q2M0Y5_DROPS 


Q2m0y5 


drosophila 


23 


1378.5 


33. 


.4 


1219 


2 


Q9VTS0_DROME 


Q9vts0 


drosophila 


24 


1369 


33. 


.2 


910 


1 


TM16F_HUMAN 


Q4kmq2 


homo sapien 


25 


1367.5 


33 . 


.2 


712 


2 


Q8NCT 7_HUMAN 


Q8nct7 


homo sapien 


26 


1366.5 


33. 


.1 


1613 


2 


Q 4 S 1 F 6_TETNG 


Q4slf6 


tetraodon n 


27 


1355.5 


32, 


.9 


909 


2 


A6NNM6_HUMAN 


A6nnm6 


homo sapien 


28 


1351 


32 . 


.8 


904 


1 


TM16E_MOUSE 


Q75ur0 


mus musculu 


29 


1351 


32. 


.8 


904 


2 


Q3V65 7_MOUSE 


Q3v657 


mus musculu 


30 


1346 


32. 


.6 


871 


2 


Q8JFT1_DANRE 


Q8 jftl 


danio rerio 


31 


1344 


32. 


.6 


926 


2 


Q8IN71_DROME 


Q8in71 


drosophila 


32 


1344 


32. 


.6 


1075 


2 


Q9VDV4_DROME 


Q9vdv4 


drosophila 


33 


1343 


32. 


.6 


972 


2 


Q86P2 4_DROME 


Q86p24 


drosophila 


34 


1336.5 


32. 


.4 


911 


1 


TM16F_MOUSE 


Q6p9 j9 


mus musculu 


35 


1329 


32 . 


.2 


1062 


2 


Q2 93M2_DROPS 


Q293m2 


drosophila 


36 


1291 


31. 


.3 


895 


2 


Q 1 6 L 0 2_AEDAE 


Q16102 


aedes aegyp 


37 


1290 


31. 


.3 


1035 


2 


Q4SSV5_TETNG 


Q4ssv5 


tetraodon n 


38 


1281 


31. 


.1 


863 


2 


Q16MH2_AEDAE 


Q16mh2 


aedes aegyp 


39 


1251.5 


30. 


.4 


823 


2 


Q7QEP9_ANOGA 


Q7qep9 


anopheles g 


40 


1227.5 


29, 


.8 


971 


2 


Q4REV7_TETNG 


Q4rev7 


tetraodon n 


41 


1221 


29, 


.6 


1017 


2 


Q2 9H9 7_DROPS 


Q29h97 


drosophila 


42 


1207 


29, 


.3 


984 


2 


Q 8 MT 6 2_DROME 


Q8mt62 


drosophila 


43 


1205 


29, 


.2 


854 


2 


Q2VPA8_MOUSE 


Q2vpa8 


mus musculu 


44 


1159 . 5 


28 , 


. 1 


1043 


2 


Q9VYS8_DROME 


Q9vys8 


drosophila 


45 


1159.5 


28. 


.1 


1059 


2 


Q76NS2_DROME 


Q76ns2 


drosophila 



ALIGNMENTS 



RESULT 1 
TM1 6 G_HUMAN 

ID TM1 6 G_HUMAN Reviewed; 933 AA. 

AC Q6IWH7; Q6IWH6; 

DT 29-MAY-2007, integrated into UniProtKB/Swiss-Prot . 
DT 05-JUL-2004, sequence version 1. 
DT 24-JUL-2007, entry version 21. 

DE Transmembrane protein 16G (New gene expressed in prostate) (Prostate 
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DE cancer-associated protein 5) (IPCA-5) (Dresden-transmembrane protein 

DE of the prostate) (D-TMPP) . 

GN Name=TMEM16G; Synonyms=NGEP, PCANAP5; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 

OC Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 1 AND 2), TISSUE SPECIFICITY, 

RP SUBCELLULAR LOCATION, AND TOPOLOGY. 

RX PubMed=14981236; DOI=10 . 1073/pnas . 0308746101; 

RA Bera T.K., Das S., Maeda H., Beers R., Wolfgang CD., Kumar V., 

RA Hahn Y., Lee B., Pastan I.; 

RT "NGEP, a gene encoding a membrane protein detected only in prostate 

RT cancer and normal prostate."; 

RL Proc. Natl. Acad. Sci. U.S.A. 101:3 059-3064(2 004). 

RN [2] 

RP IDENTIFICATION. 

RX PubMed=10613842; DOI=10 . 1101/gr . 9 .12 . 1198; 

RA Walker M.G., Volkmuth W., Sprinzak E., Hodgson D., Klingler T.; 

RT "Prediction of gene function by genome-scale expression analysis: 

RT prostate cancer-associated genes."; 

RL Genome Res. 9:119 8-1203(199 9). 

RN [3] 

RP IDENTIFICATION. 

RX PubMed=15375614; 

RA Katoh M. , Katoh M. ; 

RT "Characterization of human TMEM16G gene in silico."; 

RL Int. J. Mol. Med. 14:759-764(2004). 

RN [4] 

RP ALTERNATIVE SPLICING (ISOFORM 3), TISSUE SPECIFICITY, AND INDUCTION BY 

RP ANDROGEN . 

RX PubMed=15761874; DOI=10 . 1002/pros . 20250; 

RA Kiessling A., Weigle B., Fuessel S., Ebner R. , Meye A., Rieger M.A., 

RA Schmitz M., Temme A., Bachmann M. , Wirth M.P., Rieber E.P.; 

RT "D-TMPP: a novel androgen-regulated gene preferentially expressed in 

RT prostate and prostate cancer that is the first characterized member of 

RT an eukaryotic gene family."; 

RL Prostate 64:387-400(2005). 

RN [5] 

RP FUNCTION, SUBCELLULAR LOCATION, AND TISSUE SPECIFICITY. 

RX PubMed=17308099; DOI=l 0 . 1 158 /0 008-5472 . CAN-06-26 73 ; 

RA Das S., Hahn Y., Nagata S., Willingham M.C., Bera T.K., Lee B., 

RA Pastan I . ; 

RT "NGEP, a prostate-specific plasma membrane protein that promotes the 

RT association of LNCaP cells."; 

RL Cancer Res. 67:1594-1601(2007). 

CC -!- FUNCTION: May play a role in cell-cell interactions. 

CC -!- SUBCELLULAR LOCATION: Isoform 1: Cell membrane; Multi-pass 

CC membrane protein. Isoform 2: Cytoplasm, cytosol. Note=Isoform 1 

CC concentrates at sites of cell-cell contact. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isof orms=3 ; 

CC Name=l; Synonyms=NGEP-L ; 
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CC 


IsoId= 


Q6IWH7 


-1; Sequence=Displayed; 


cc 


Name=2; 


Synonyms=NGEP-S ; 




cc 


IsoId= 


Q6IWH7 


-2; Sequence=VSP_026004, VSP_026005, VSP_026006; 


cc 


Name=3 ; 


Synonyms=D-TMPP ; 




cc 


IsoId= 


Q6IWH7 


-3; Sequence=VSP_026004, VSP_026007, VSP_026008; 


cc 


-!- TISSUE SPECIFICITY: Specifically expressed in epithelial cells of 


cc 


the prostate (at protein 


level) . 


cc 


-!- INDUCTION: Up- 


regulated 


by androgen. 


cc 


-!- SIMILARITY: Belongs to the TMEM16 family. 


cc 
cc 
cc 


-!- CAUTION 


It is 


uncertain 


whether Met-1 or Met-55 is the initiator. 


Copyrighted 


by the 


UniProt Consortium, see http://www.uniprot.org/terms 


cc 
cc 

DR 


Distributed 


under 


the Creative Commons Attr ibution-NoDerivs License 


EMBL; AY617079; AAT40139.1; 


-; mRNA. 


DR 


EMBL; AY617C 


80; AAT40140.1; 


- ; mRNA . 


DR 


UniGene; Hs 


163909 






DR 


Ensembl; ENSG00000146205; Homo sapiens. 


DR 


HGNC; HGNC:31677; 


TMEM16G. 




DR 


MIM; 605096, 


gene . 






DR 


PharmGKB; PA32 980; 






DR 


ArrayExpress ; Q6IWH7; -. 




DR 


GO; GO:00056 


86; C: 


plasma membrane; IDA : MGI . 


DR 


InterPro; IPR007632; DUF590. 




DR 


PANTHER; PTHR12308 


; DUF590; 


1. 


DR 


Pfam; PF04547; DUF590; 1. 




PE 


1 : Evidence 


at protein level 




KW 


Alternative 


splicing; Cytoplasm; Membrane; Polymorphism; 


KW 


Transmembrane . 






FT 


CHAIN 


1 


933 


Transmembrane protein 16G. 


FT 








/FTId=PRO_0000289326. 


FT 


TOPO_DOM 


1 


355 


Extracellular (Potential) . 


FT 


TRANSMEM 


356 


376 


Potential . 


FT 


TOPO_DOM 


377 


420 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


421 


441 


Potential . 


FT 


TOPO_DOM 


442 


499 


Extracellular (Potential). 


FT 


TRANSMEM 


500 


520 


Potential . 


FT 


TOPO_DOM 


521 


550 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


551 


571 


Potential . 


FT 


TOPO_DOM 


572 


588 


Extracellular (Potential). 


FT 


TRANSMEM 


589 


609 


Potential . 


FT 


TOPO_DOM 


610 


714 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


715 


735 


Potential . 


FT 


TOPO_DOM 


736 


763 


Extracellular (Potential). 


FT 


TRANSMEM 


764 


784 


Potential . 


FT 


TOPO_DOM 


785 


843 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


844 


864 


Potential . 


FT 


TOPO_DOM 


865 


933 


Extracellular (Potential). 


FT 


VAR_SEQ 


110 


110 


Missing (in isoform 2 and isoform 3). 


FT 








/FTId=VSP_026004 . 


FT 


VAR_SEQ 


158 


180 


QDVQDGNTTVHYALLSASWAVLC -> VRGGCHGQGPRPC I 


FT 








HSVTHDLAA (in isoform 2) . 


FT 








/FTId=VSP_026005 . 


FT 


VAR_SEQ 


181 


933 


Missing (in isoform 2). 


FT 








/FTId=VSP_026006 . 


FT 


VAR_SEQ 


859 


859 


H -> VAEAPAGSPIHGMRPRPCALPNSSTW (in 
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T isof orm 3 ) . 

T /FTId=VSP_026007 . 

T VAR_SEQ 860 933 Missing (in isoform 3) . 

T /FTId=VSP_026008 . 

T VARIANT 67 67 V -> I (in dbSNP : rs2302054 ) . 

T /FTId=VAR_032616 . 

Q SEQUENCE 933 AA; 105531 MW; D6FD42578A41D7D3 CRC64; 

Query Match 100.0%; Score 4123; DB 1; Length 933; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 777; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

y 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 15 7 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 216 

y 61 LEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
■b 217 LEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 2 76 

y 121 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 277 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 336 

y 181 RYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 33 7 RYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 3 96 

y 241 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 3 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 39 7 CLDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 456 

y 301 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAVWMCLV 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 457 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAWVMCLV 516 

y 361 SI ILYRAIMAI VVSRSGNTLLAAWASRIASLTGS WNLVFILILSKI YVSLAHVLTRWEM 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 517 SI ILYRAIMAI VVSRSGNTLLAAWASRIASLTGSWNLVFILILSKI YVSLAHVLTRWEM 576 

y 421 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 577 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 636 

y 481 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 63 7 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 6 96 

y 541 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 69 7 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 756 

y 601 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 75 7 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 816 
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Qy 



661 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIP 72 0 




Db 



817 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIP 8 76 



Qy 



721 ESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 77 7 




Db 



877 ESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKASQLQQ 93 3 



RESULT 2 

TM16G_MOUSE 

ID TM16G_MOUSE 



Reviewed; 



859 AA. 



AC Q14AT5; Q6IFT5; 

DT 29-MAY-2007, integrated into UniProtKB/Swiss-Prot . 

DT 29-MAY-2007, sequence version 2. 

DT 24-JUL-2007, entry version 9. 

DE Transmembrane protein 16G (New gene expressed in prostate homolog) . 

GN Name=Tmeml6g; Synonyms=Ngep ; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Glires; Rodentia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RC STRAIN=C57BL/6J; 

RG The mouse genome sequencing consortium; 

RL Submitted (JAN-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). 

RX PubMed=15489334; DOI=l 0 . 1 10 1 /gr . 2 59650 4 ; 

RG The MGC Project Team; 

RT "The status, quality, and expansion of the NIH full-length cDNA 

RT project: the Mammalian Gene Collection (MGC)."; 

RL Genome Res. 14:2121-2127(2004). 

RN [3] 

RP IDENTIFICATION. 

RX PubMed=14981236; DOI=10 . 1073/pnas . 0308746101; 

RA Bera T.K., Das S., Maeda H., Beers R., Wolfgang CD., Kumar V., 

RA Hahn Y., Lee B., Pastan I.; 

RT "NGEP, a gene encoding a membrane protein detected only in prostate 

RT cancer and normal prostate."; 

RL Proc. Natl. Acad. Sci. U.S.A. 101:3 059-3064(2 004). 

CC -!- FUNCTION: May play a role in cell-cell interactions (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Cell membrane; Multi-pass membrane protein 
CC (By similarity) . Note=Concentrates at sites of cell-cell contact 

CC (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isof orms=2 ; 

CC Name=l; 

CC IsoId=Q14AT5-l; Sequence=Displayed; 

CC Name=2; 

CC IsoId=Q14AT5-2; Sequence=VSP_026009, VSP_026010; 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 
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cc 














cc 


Copyrighted 


by the 


UniProt 


Consortium, see http : //www . uniprot 


org/terms 


cc 
cc 

DR 


Distributed 


under 


the Creative Commons Attribution-NoDerivs License 


EMBL; AC108412; -; 


NOT_ANNOTATED_CDS; Genomic_DNA. 




DR 


EMBL; AC124669; -; 


NOT_ANNOTATED_CDS; Genomic_DNA. 




DR 


EMBL; BC116706; AAI16707.1; 


-; mRNA. 




DR 


EMBL; BK004075; DAA04566.1; 


-; mRNA. 




DR 


UniGene ; 


Mm 


297411 








DR 


Ensembl ; 


ENSMUSG00000034107 


; Mus musculus. 




DR 


MGI; MGI: 


3052714; 


Tmeml6g. 






DR 


InterPro ; 


IPR007632; DUF590 






DR 


PANTHER; 


PTHR12308 


; DUF590; 


1. 




DR 


Pfam; PF04547; DUF590; 1. 






PE 


2 : Evidence 


at transcript level; 




KW 


Alternative 


splicing; Membrane; Transmembrane. 




FT 


CHAIN 




1 


859 


Transmembrane protein 16G. 




FT 










/FTId=PRO_0 0002 8 932 7. 




FT 


TOPO_DOM 




1 


297 


Extracellular (Potential). 




FT 


TRANSMEM 




298 


318 


Potential . 




FT 


TOPO_DOM 




319 


362 


Cytoplasmic (Potential). 




FT 


TRANSMEM 




363 


383 


Potential . 




FT 


TOPO_DOM 




384 


441 


Extracellular (Potential). 




FT 


TRANSMEM 




442 


462 


Potential . 




FT 


TOPO_DOM 




463 


492 


Cytoplasmic (Potential). 




FT 


TRANSMEM 




493 


513 


Potential . 




FT 


TOPO_DOM 




514 


530 


Extracellular (Potential) . 




FT 


TRANSMEM 




531 


551 


Potential . 




FT 


TOPO_DOM 




552 


651 


Cytoplasmic (Potential) . 




FT 


TRANSMEM 




652 


672 


Potential . 




FT 


TOPO_DOM 




673 


700 


Extracellular (Potential) . 




FT 


TRANSMEM 




701 


721 


Potential . 




FT 


TOPO_DOM 




722 


780 


Cytoplasmic (Potential). 




FT 


TRANSMEM 




781 


801 


Potential . 




FT 


TOPO_DOM 




802 


859 


Extracellular (Potential). 




FT 


VAR_SEQ 




836 


843 


ALL GAT GV -> VTVGVTGG (in isoform 


2) . 


FT 










/FTId=VSP_026009 . 




FT 


VAR_SEQ 




844 


859 


Missing (in isoform 2). 




FT 










/FTId=VSP_026010 . 




SQ 


SEQUENCE 




359 AA; 


97128 MW; 82E1A473C59C8DA3 CRC64; 





Query Match 84.4%; Score 3478.5; DB 1; 

Best Local Similarity 85.8%; Pred. No. 5.1e-282; 
Matches 653; Conservative 33; Mismatches 70; 



Length 859; 
Indels 5; Gaps 



1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 

I I I I I III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I : I 

9 9 QHDVQDEAAAVHYILLRAPWAVLCYYAEDLRLKLPLQELPNQASNWSATLLEWLGIPNIL 158 

61 LEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 120 

II III I I I I I I I : I : : I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

15 9 LEHVPDTPPEYYSCQFKASKLQWFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKGLF 218 



121 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 18 
II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 219 GIDQLLAE GVFSAAFPLHDGPFSAVPESSQVLGLIQRQVLFQHWARWGKWNKYQPLDHVR 278 



181 RYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 2 40 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I : I I I I I I I I I I I I I I I I I I I I 111:1111 
2 79 RYFGEKVALYFAWLGFYTGWLLPAAVVGTWFLVGCFLVFSDIPTQELCHSSDSFDMCPL 338 

2 41 CLDCPFWLLSS ACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKS ATLAYRWDCSD 3 0 0 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I 
33 9 CSDCSFWLLSSACTLAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKNATLAYRWDCSD 3 98 

301 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIVVMVAWVMCLV 360 

III I I I I I I I I I I : I I I I I I I I I I I I I I I I I I :: I I I I I I I I I I : : : I I I I I : I I I I 
39 9 YEDIEERPRPQFAATAPMTALNPITGEDEPYFPEKNRVRRMLAGSVVLLMMVAVVIMCLV 458 

361 SI ILYRAIMAI VVSRSGNTLLAAWASRIASLTGSVVNLVFILILSKI YVSLAHVLTRWEM 42 0 

I : I I I I I : I I I : I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : I I II I I I I I I I 
45 9 SVILYRAVMAI IVSRSDNAFLSAWASRIASLTGSVVNLVFILILSKVYVLLAQVLTRWEM 518 

421 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 480 

I I I I I : I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I 
519 HRTQTEFEDAFTLKVFIFQFVNFYASPVYIAFFKGRFVGYPGNYHTLFGIRNEECPAGGC 5 78 

481 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 5 40 

I I I I I I I I I I I I I I I : I I I : I I I I : I I I I I I I I I I : : I I I I I I I I I 

5 7 9 L S E L AQE L L VI MVGKQ 1 1 NNVQE VL VPKLKGCWQKF SRGKKAG — TGTHPAPWEADY 6 33 

541 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 6 00 

11:111111 I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
63 4 ELLPCEGLFHEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 6 93 

601 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I II I I I I I I I I I I I 
69 4 ERAQDIGIWFHILTGLTHLAVISNAFLLAFSSDFLPRVYYSWTHAPDLHGFLNFTLARAP 753 

661 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVI VFEHVVFSVGRLLDLLVPDIP 72 0 

: I : I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I : I I : I I I I I I I I I 
75 4 PTFTSAHNRTCRYRAFRDDDGHYSPTYWTLLAIRLAFVI VFEHVVFS IGRVLDLLVPDIP 813 

721 ESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELS 761 

I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I II I 
814 ESVEIKVKREYYLAKQALAENEALLGATGVKDDQPPSSEPS 854 



RESULT 3 
TM16G_RAT 

ID TM16G_RAT Reviewed; 860 AA. 

AC Q6IFT6; 

DT 29-MAY-2007, integrated into UniProtKB/Swiss-Prot . 

DT 05-JUL-2004, sequence version 1. 

DT 24-JUL-2007, entry version 22. 

DE Transmembrane protein 16G (New gene expressed in prostate homolog) . 

GN Name=Tmeml6g; Synonyms=Ngep; 

OS Rattus norvegicus (Rat). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Rattus. 
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OX NCBI_TaxID=10116; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RC STRAIN=Brown Norway; 

RX PubMed=15057822; DOI=10 . 1038/nature02426; 

RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J. 

RA Scherer S., Scott G. , Steffen D., Worley K.C., Burch P.E., Okwuonu G 

RA Hines S., Lewis L., Deramo C, Delgado 0., Dugan-Rocha S., Miner G., 

RA Morgan M. , Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G. 

RA Baden-Tillson H . , Barnstead M. , Chin S., Evans C.A., Ferriera S., 

RA Fosler C, Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T., 

RA Pfannkoch CM., Sitter C, Sutton G.G., Venter J.C., Woodage T., 

RA Smith D., Lee H.-M., Gustafson E., Cahill P., Kana A., 

RA Doucette-Stamm L., Weinstock K., Fechtel K . , Weiss R.B., Dunn D.M., 

RA Green E.D., Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K. 

RA Zhu B., Marra M., Schein J., Bosdet I., Fjell C, Jones S., 

RA Krzywinski M., Mathewson C, Siddiqui A., Wye N., McPherson J., 

RA Zhao S., Fraser CM., Shetty J., Shatsman S., Geer K., Chen Y., 

RA Abramzon S., Nierman W.C, Havlak P.H., Chen R., Durbin K.J., Egan A 

RA Ren Y., Song X.-Z., Li B., Liu Y., Qin X., Cawley S., Cooney A.J., 

RA D'Souza L.M., Martin K., Wu J.Q., Gonzalez-Garay M.L., Jackson A.R., 

RA Kalafus K.J., McLeod M.P., Milosavl jevic A., Virk D., Volkov A., 

RA Wheeler D.A., Zhang Z., Bailey J. A., Eichler E.E., Tuzun E., 

RA Birney E., Mongin E., Ureta-Vidal A., Woodwark C, Zdobnov E., 

RA Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J., 

RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., 

RA Schmidt J., Stevens K., Vitt U., Wingrove J., Camara F . , Mar Alba M. 

RA Abril J.F., Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne 0., 

RA Poliakov A., Huebner N . , Ganten D., Goesele C, Hummel 0., 

RA Kreitler T., Lee Y.-A., Monti J., Schulz H., Zimdahl H., 

RA Himmelbauer H., Lehrach H., Jacob H.J., Bromberg S., 

RA Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E., Lazar J., 

RA Pasko D., Tonellato P.J., Twigger S., Ponting CP., Duarte J.M., 

RA Rice S., Goodstadt L., Beatson S.A., Ernes R.D., Winter E.E., 

RA Webber C, Brandt P., Nyakatura C, Adetobi M., Chiaromonte F., 

RA Elnitski L., Eswara P., Hardison R.C, Hou M. , Kolbe D., Makova K., 

RA Miller W. , Nekrutenko A., Riemer C, Schwartz S., Taylor J., Yang S. 

RA Zhang Y., Lindpaintner K., Andrews T.D., Caccamo M. , Clamp M. , 

RA Clarke L., Curwen V., Durbin R.M., Eyras E., Searle S.M., Cooper CM 

RA Batzoglou S., Brudno M., Sidow A., Stone E.A., Payseur B.A., 

RA Bourque C, Lopez-Otin C, Puente X.S., Chakrabarti K., Chatter ji S. 

RA Dewey C, Pachter L., Bray N., Yap V.B., Caspi A., Tesler C, 

RA Pevzner P. A., Haussler D., Roskin K.M., Baertsch R. , Clawson H., 

RA Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J., Rosenbloom K.R., 

RA Trumbower H., Weirauch M. , Cooper D.N., Stenson P.D., Ma B., Brent M 

RA Arumugam M., Shteynberg D., Copley R.R., Taylor M.S., Riethman H., 

RA Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S., Mockrin S 

RA Collins F.S. ; 

RT "Genome sequence of the Brown Norway rat yields insights into 

RT mammalian evolution."; 

RL Nature 428:493-521(2004). 

RN [2] 

RP IDENTIFICATION. 

RX PubMed=14981236; DOI=10 . 1073/pnas . 0308746101; 

RA Bera T.K., Das S., Maeda H., Beers R., Wolfgang CD., Kumar V., 
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RA Hahn Y., Lee B., Pastan I.; 

RT "NGEP, a gene encoding a membrane protein detected only in prostate 

RT cancer and normal prostate."; 

RL Proc. Natl. Acad. Sci. U.S.A. 101:3059-3064(2004). 

CC -!- FUNCTION: May play a role in cell-cell interactions (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Cell membrane; Multi-pass membrane protein 
CC (By similarity) . Note=Concentrates at sites of cell-cell contact 

CC (By similarity) . 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AABR03 06 8351; -; NOT_ANNOTATED_CDS ; Genomic_DNA. 

DR EMBL; AABR03 06 9 02 9; -; NOT_ANNOTATED_CDS ; Genomic_DNA. 

DR EMBL; AABR03 0 7 0 76 7 ; -; NOT_ANNOTATED_CDS ; Genomic_DNA. 

DR EMBL; BK004074; DAA04565.1; -; mRNA. 

DR UniGene; Rn. 163084; -. 

DR Ensembl; ENSRNOG00000023427 ; Rattus norvegicus. 

DR RGD; 1302987; Ngep . 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

KW Membrane; Transmembrane. 



FT 


CHAIN 


1 


860 


Transmembrane protein 16G 


FT 








/FTId=PRO_0000289328. 


FT 


T0P0_D0M 


1 


297 


Extracellular (Potential) 


FT 


TRANSMEM 


298 


318 


Potential . 


FT 


TOPO_DOM 


319 


362 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


363 


383 


Potential . 


FT 


TOPO_DOM 


384 


441 


Extracellular (Potential) 


FT 


TRANSMEM 


442 


462 


Potential . 


FT 


TOPO_DOM 


463 


492 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


493 


513 


Potential . 


FT 


TOPO_DOM 


514 


530 


Extracellular (Potential) 


FT 


TRANSMEM 


531 


551 


Potential . 


FT 


TOPO_DOM 


552 


652 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


653 


673 


Potential . 


FT 


TOPO_DOM 


674 


701 


Extracellular (Potential) 


FT 


TRANSMEM 


702 


722 


Potential . 


FT 


TOPO_DOM 


723 


779 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


780 


800 


Potential . 


FT 


TOPO_DOM 


801 


860 


Extracellular (Potential) 


SQ 


SEQUENCE 


860 AA; 


97170 


MW; 96BE3CBD6DE96101 CRC64; 



Query Match 84.2%; Score 3472; DB 1; Length 860; 

Best Local Similarity 85.7%; Pred. No. 1.8e-281; 

Matches 652; Conservative 36; Mismatches 69; Indels 4; Gaps 2; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVL 6 0 

I : I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I : I 

Db 99 QRDVQDEAAAVHYILLSAPWAVLCYYAEDLRLKLPLQELPNQASNWSATLLEWLGIPNIL 158 

Qy 61 LEVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLL 120 
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II III I I I I I I I : I : : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I 

15 9 LENVPDTPPEYYSCQFKASKLQWFLGSDNQDTFFTSTKRHQILFEILAKTPYGHQKKGLF 218 

121 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 

II I I I I I I I I I I I I I I I I I I II I I 111111:11111111 I I I I I I I I I 

219 GIDQLLAE GVFSAAFPLHDGPFSVVPESSQVLGLTQRQVLFKHWARWGKWRKYQPLDHVR 2 78 

181 RYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I 11111111:111111 I I : I : I I I I 
2 79 RYFGEKVALYFAWLGFYTGWLLPAAVVGTWFLAGCFLVFSDVPTQELCHSSDTFDMCPL 338 

2 41 CLDCPFWLLSS ACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKS ATLAYRWDCSD 3 0 0 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I 
33 9 CSDCSFWLLSSACTLAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKNATLAYRWDCSD 3 98 

3 01 YEDTEERPRPQFAASAPMTAPNPI TGEDEPYFPERSRARRMLAGS VVI VVMVAVVVMCLV 3 6 0 

III 1111111111:11111 I I I II I I I I I I I I : : I I I I I I I I I ::: I I I I I : I I I 

39 9 YEDIEERPRPQFAATAPMTALNPITGEDEPYFPEKNRVRRMLAGSVVLLMMVAVVIMCLV 458 

361 SI ILYRAIMAI VVSRSGNTLLAAWASRIASLTGS VVNLVFILILSKI YVSLAHVLTRWEM 42 0 

I I I I I I I : I I I : I I : I I I : I I I I I I I I I I I I I I I I I I I I I I I I : I I II I I I I I I I 
45 9 SI ILYRAVMAI IVSKSNNAFLSAWASRIASLTGSVVNLVFILILSKVYVILAQVLTRWEM 518 

421 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGVRNEECAAGGC 480 

I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
519 HRTQTAFEDAFTLKVFIFQFVNFYASPVYIAFFKGRFVGYPGNYHTLFGVRNEECPAGGC 5 78 

481 LIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDY 540 

I I I I I I I I I I I I I I I : I I I : I I I I : I I I I I III I I : : : I I I II: III II 
57 9 L S E L AQE L L VI MVGKQ 1 1 NNVQE VL VPKLKGCWQK — LCSRRKKAG — MGANPAPWEADY 634 

541 ELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 6 00 

I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
63 5 ELLPCEGLFHEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVA 6 94 

601 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRWTRAHDLRGFLNFTLARAP 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
695 ERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRVYYSWTRAPDLRGFLNFTLARAP 754 

661 SSFAAAHNRTCRYRAFRDDDGHYSQTYWNLLAIRLAFVI VFEHVVFSVGRLLDLLVPDIP 72 0 

: I : I I I I I I I I I I I I I I I I I I I III I I I I I II I I I I I I I I I I I II I I I I I I I I I 
755 PTFTSAHNRTCRYRAFRDDDGHYSPTYWTLLAIRLAFVI VFEHVVFSTGRFLDLLVPDIP 814 

721 ESVEIKVKREYYLAKQALAENEVLFGTNGTKDEQPKGSELS 761 

I I I I I I I I I I I I I I I I I I I : I I I I I I III II I 
815 ESVEIKVKREYYLAKQALADNEALLGATGVKGEQPPSSEPS 855 



RESULT 4 
A6QLE6_BOVIN 

ID A6QLE6_BOVIN Unreviewed; 920 

AC A6QLE6 ; 

DT 21-AUG-2007, integrated into UniProtKB/TrEMBL . 
DT 21-AUG-2007, sequence version 1. 
DT 21-AUG-2007, entry version 1. 
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DE Putative uncharacterized protein. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; 

OC Pecora; Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=L1 Hereford; TISSUE=Basal ganglia; 

RA Moore S., Alexander L . , Brownstein M., Guan L . , Lobo S., Meng Y. , 

RA Tanaguchi M. , Wang Z., Yu J . , Prange C, Schreiber K., Shenmen C, 

RA Wagner L., Bala M., Barbazuk S., Barber S., Babakaiff R. , Beland J., 

RA Chun E., Del Rio L., Gibson S., Hanson R., Kirkpatrick R., Liu J., 

RA Matsuo C, Mayo M., Santos R.R., Stott J., Tsai M., Wong D., 

RA Siddiqui A., Holt R. , Jones S.J., Marra M.A. ; 

RL Submitted (JUN-2007) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attr ibution-NoDerivs License 

CC 

DR EMBL ; BC147937; AAI47938.1; -; mRNA. 

PE 2: Evidence at transcript level; 

SQ SEQUENCE 920 AA; 107708 MW; 3 AFFC6 31 0F1 86 2CD CRC64; 

Query Match 36.5%; Score 1505.5; DB 2; Length 920; 

Best Local Similarity 40.4%; Pred. No. 1.6e-116; 

Matches 328; Conservative 145; Mismatches 270; Indels 69; Gaps 20; 

Qy 8 NTTVHYALLSASWAVLCYYAEDLRLKLPLQE LPNQASNWS AGLLAWLGIP 5 7 

I : : : I I I I I I I I : : : : I : I I : I : II 

Db 122 NSDIIFVKLHAPWEVLGRYAEQMNVRMPFRRKIYYLPRRYKFMSRIDKQISRFRRWLPKK 181 

Qy 58 NVLL — EVVPDVPP-EYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGH 114 

: I I : I I : : I : I : : I : I : I I I I : I : I : I I : I 
Db 182 PMRLDKETLPDLEENDCYTAPFSQQRIHHFI-IHNKDTFFNNATRSRIVHHILQRIKY-E 239 

Qy 115 EKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQ 174 

I I I : I : : : I I I I I I I I I : I : : : : I I : I : : I I I I I I I I 

Db 240 EGKNKIGLNRLL TNGSYEAAFPLHEGSYRSKNSIRTHGAVNHRHLLYECWASWGVWYKYQ 2 99 

Qy 175 PLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIP TQELCGSKDS 234 

I I I I I I I I I I I : I I I I I I I : I I I I I I I : I I I I I : : : I : I : I 

Db 300 PLDLVRRYFGEKIGLYFAWLGWYTGMLFPAAFIGLFVFLYGVITLDHCQVSKEVCQATDI 359 

Qy 235 FEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLA 293 

I I I : I III I I : I I : I I I : I I I I I : : I I I : I I : I I : I I I : I : I 
Db 360 I-MCPVCDKYCPFMRLSDSCVYAKVTHLFDNGATVFFAVFMAVWATVFLEFWKRRRAVIA 418 

Qy 29 4 YRWDCSDYEDTEERPRPQFAAS-APMTAPNPI TGEDEPYFPERSRARRMLAGSVVIVVMV 3 52 

III I : I : I I I I I I I : I I I : I : I I I : I : : : I I : 

Db 419 YDWDLIDWEEEEEEIRPQFEAKYSKKERMNPISGKPEPYQAFADKCSRLIVSASGIFFMI 4 78 

Qy 353 AVVVMCLVS 1 1 LYRAIMAI VVSRS GNTLLA-AWA SRI ASLTGSVV — NLVFILIL 404 

II: : I : : I I : : I I II I : : I : I I : I I I : : I 

Db 479 CWIAAVFGIVIYRWTV S TF AAF KWAL I RNN S Q V AT - T G T A VC INFCIIMLL 530 
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40 5 SKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNY 46 4 

: : I : I : I I I I I : : : : I : : I I I I : I : I I I I I II I I I I I III I : I I I 
531 NVLYEKVALLLTNLEQPRTESEWENSFTLKMFLFQFVNLNSSTFYIAFFLGRFTGHPGAY 590 

46 5 HTLFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKR 523 

I I I I I I I I I : I : : : I I I I I II I : I : : I I : I : : 
591 LRLINRWRLEECHPSGCLIDLCMQMGIIMVLKQTWNNFMELGYPLIQNWWTR RKVRQ 6 47 

52 4 KAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNW 581 

: I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I 

6 4 8 EHGPERKI SFPQWEKDYNLQPMNAYGLFDEYLEMILQFGFTT IFVAAFPLAPLLALLNNI 7 0 7 

58 2 VEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYR 6 41 

: I I I I I I III : : I I I : I I I : I I I I I : II I : I : I I : I I I : : I : I I I : I I I 
70 8 IEIRLDAYKFVTQWRRPLASRAKDIGIWYGI LEGIGILSVITNAFVI AITSDFIPRLVYA 76 7 

6 42 W TRAHDLRGFLNFTLA RAPSSFAAAHNRTCRYRAFR 6 77 

: : I : : I : I : I I : : I I I I : I 

76 8 YKYGPCAGQGEAGQKCMVGYVNASLSVFRISDFENRSEPESDGSEFSGTPLKYCRYRDYR 82 7 

6 78 DDDGH YSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYY 732 

I I : : I : : I I I I I I : I I I I 1 : I I : I : I : I I : I : : : : : I I I 

82 8 DPPHSLVPYGYTLQFWHVLAARLAFI IVFEHLVFCIKHLISYLIPDLPKDLRDRMRREKY 887 

733 LAKQALAENEVLFGTNGTKDEQPKGSELSSHW 764 

I : : : I I : I : : I : I 

888 LIQEMMYEAELERLQKERKERKKNGKAHHNEW 919 



RESULT 5 
TM16D_HUMAN 

ID TM1 6 D_HUMAN Reviewed; 955 AA. 

AC Q32M45; Q8NAJ0; Q8NB39; Q8NB53; 

DT 29-MAY-2007, integrated into UniProtKB/Swiss-Prot . 

DT 06-DEC-2005, sequence version 1. 

DT 21-AUG-2007, entry version 17. 

DE Transmembrane protein 16D. 

GN Name=TMEM16D; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 

OC Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 3). 

RC TISSUE=Brain, and Prostate; 

RX PubMed=14702039; DOI = 10 . 1038/ngl285 ; 

RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R. , 

RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., 

RA Sekine M. , Obayashi M. , Nishi T., Shibahara T., Tanaka T., Ishii S., 

RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., 

RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., 

RA Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., 

RA Takahashi M. , Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., 

RA Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M. , Yamazaki M. , 
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RA Ninomiya K., Ishibashi T . , Yamashita H., Murakawa K., Fujimori K., 

RA Tanai H., Kimata M . , Watanabe M. , Hiraoka S., Chiba Y., Ishida S., 

RA Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., 

RA Kanehori K., Takahashi-Fu j ii A., Hara H . , Tanase T . -0 . , Nomura Y . , 

RA Togiya S., Komai F . , Hara R., Takeuchi K., Arita M. , Imose N . , 

RA Musashino K., Yuuki H., Oshima A., Sasaki N . , Aotsuka S., 

RA Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N . , Sano S., 

RA Moriya S., Momiyama H., Satoh N . , Takami S., Terashima Y., Suzuki 0., 

RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., 

RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M. , Kawakami B., 

RA Yamazaki M . , Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., 

RA Fujimori Y., Komiyama M . , Tashiro H., Tanigami A., Fujiwara T., 

RA Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., 

RA Kawabata A., Hikiji T . , Kobatake N . , Inagaki H., Ikema Y., Okamoto S., 

RA Okitani R., Kawakami T., Noguchi S., Itoh T . , Shigeta K., Senba T., 

RA Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., 

RA Togashi T., Oyama M. , Hata H., Watanabe M., Komatsu T., 

RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., 

RA Okumura K., Nagase T., Nomura N., Kikuchi H . , Masuho Y., Yamashita R. , 

RA Nakai K., Yada T . , Nakamura Y., Ohara 0., Isogai T., Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). 

RX PubMed=15489334; DOI=10 . 1101/gr . 2596504; 

RG The MGC Project Team; 

RT "The status, quality, and expansion of the NIH full-length cDNA 

RT project: the Mammalian Gene Collection (MGC)."; 

RL Genome Res. 14:2121-2127(2004). 

RN [3] 

RP IDENTIFICATION. 

RX PubMed=12739008; 

RA Katoh M. , Katoh M. ; 

RT "FLJ10261 gene, located within the CCND1-EMS1 locus on human 

RT chromosome llql3, encodes the eight-transmembrane protein homologous 

RT to C12orf3, Cllorf25 and FLJ34272 gene products."; 

RL Int. J. Oncol. 22:13 75-13 81(20 03). 

CC -!- SUBCELLULAR LOCATION: Membrane; Multi-pass membrane protein 

CC (Potential). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=l; 

CC IsoId=Q32M45-l; Sequence=Displayed; 

CC Name=2; 

CC IsoId=Q32M45-2; Sequence=VSP_025742 ; 

CC Name=3; 

CC IsoId=Q32M45-3; Sequence=VSP_025741, VSP_025743; 

CC Note=No experimental confirmation available; 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL ; AK091540; BAC03688.1; ALT_INIT; mRNA. 

DR EMBL; AK091591; BAC03704.1; -; mRNA. 
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EMBL; AK092596; BAC03924.1; -; mRNA. 
EMBL; BC109308; AAI09309.1; -; mRNA. 
UniGene; Hs. 58785; -. 

Ensembl; ENSG00000151572; Homo sapiens. 

HGNC; HGNC:23837; TMEM16D. 

MIM; 610111; gene. 

PharmGKB; PA134975112; -. 

ArrayExpress; Q32M45; -. 

InterPro; IPR007632; DUF590. 

PANTHER; PTHR12308; DUF590; 1. 

Pfam; PF04547; DUF590; 1. 

2: Evidence at transcript level; 

Alternative splicing; Coiled coil; Glycoprotein; 
Polymorphism; Transmembrane. 



FT 


CHAIN 


1 


955 




Transmembrane protein 16D. 


FT 










/FTId=PRO_0000288650 . 


FT 


TRANSMEM 


353 


373 




Potential . 


FT 


TRANSMEM 


425 


445 




Potential . 


FT 


TRANSMEM 


506 


526 




Potential . 


FT 


TRANSMEM 


548 


568 




Potential . 


FT 


TRANSMEM 


596 


616 




Potential . 




T RAN S ME M 




736 




Potential . 


FT 


TRANSMEM 


769 


789 




Potential . 


FT 


TRANSMEM 


886 


906 




Potential . 


FT 


COILED 


918 


953 




Potential . 


FT 


CARBOHYD 


824 


824 




N-linked (GlcNAc. . .) (Potential). 


FT 


VAR_SEQ 


1 


433 




Missing (in isoform 3). 


FT 










/FTId=VSP_025741 . 


FT 


VAR_SEQ 


19 


54 




EGGVDLQGYQLDMQILPDGPKSDVDFSEILNAIQEM -: 


FT 










(in isoform 2) . 


FT 










/FTId=VSP_025742 . 


FT 


VAR_SEQ 


466 


512 




Missing (in isoform 3). 


FT 










/FTId=VSP_025743 . 


FT 


VARIANT 


115 


115 




G -> A (in dbSNP:rs34162417) . 


FT 










/FTId=VAR_032453 . 


FT 


CONFLICT 


209 


209 




F -> L (in Ref. 1; BAC03704). 


SQ 


SEQUENCE 


955 AA; 


111462 


MW; 9A9348C61A4F20AF CRC64; 


Query Match 




36. 


.3%; 


Score 1498.5; DB 1; Length 955; 


Best Local Similarity 


40, 


.3%; 


Pred. No. 6.4e-116; 



Matches 327; Conservative 145; Mismatches 271; Indels 69; Gaps 20; 

y 8 NTTVHYALLSASWAVLCYYAEDLRLKLPLQE LPNQASNWS AGLLAWLGIP 5 7 

I : : : I I I I I I I I : : : : I : I I : I : II 

b 157 NSDI IFVKLHAPWEVLGRYAEQMNVRMPFRRKIYYLPRRYKFMSRIDKQISRFRRWLPKK 216 

5 8 NVLL — EVVPDVPP-EYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGH 114 
: I I : I I : : I : I : : I : I : : I I I : I : I : II: I 

217 PMRLDKETLPDLEENDCYTAPFSQQRIHHFI-IHNKETFFNNATRSRIVHHILQRIKY-E 2 74 

115 EKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQ 174 

I I I : I : : : I I I I I I I I I : I : : : I I : I : : I I I I I I I I 

2 75 EGKNKIGLNRLLTNGSYEAAFPLHEGSYRSKNSIRTHGAENHRHLLYECWASWGVWYKYQ 334 

175 PLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDS 234 
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I I I I I I I I I I I : I I I I I I I : I I I I I I I : I I I I I : : : I : I : I 

33 5 PLDLVRRYFGEKIGLYFAWLGWYTGMLFPAAFIGLFVFLYGVTTLDHSQVSKEVCQATDI 3 94 

23 5 FEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLA 2 93 

I I I : I III I I : I I : I I I : I I I I I : : I I I : I I : I I : I I I : I : I 
395 I-MCPVCDKYCPFMRLSDSCVYAKVTHLFDNGATVFFAVFMAVWATVFLEFWKRRRAVIA 453 

29 4 YRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSWIVVMV 3 52 
III I : I : I I I I I I I : I I I : I : I I I : I : : : I I : 

45 4 YDWDLIDWEEEEEEIRPQFEAKYSKKERMNPISGKPEPYQAFTDKCSRLIVSASGIFFMI 513 

353 AVVVMCLVSI ILYRAIMAIVVSRSGNTLLA-AWA SRI ASLTGSVV — NLVFILIL 404 

II: : I : : I I : : I I II I : : I : I I : I I I : : I 

514 CWIAAVFGIVIYRWTV STFAAFKWALIRNNSQVAT-TGTAVCINFCIIMLL 565 

40 5 SKI YVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNY 46 4 

: : I : I : I I I I I : : : : I : : I I I I : I : I I I I I II I I I I I III I : I I I 
56 6 NVLYEKVALLLTNLEQPRTESEWENSFTLKMFLFQFVNLNSSTFYIAFFLGRFTGHPGAY 6 25 

46 5 HTLFG-VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKR 523 

I I I I I I I I I : I : : : I I I I I II I : I : : I I : I : : 
62 6 LRLINRWRLEECHPSGCLIDLCMQMGI IMVLKQTWNNFMELGYPLIQNWWTR RKVRQ 6 82 

52 4 KAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNW 581 

: I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I 

68 3 EHGPERKI SFPQWEKDYNLQPMNAYGLFDEYLEMILQFGFTT IFVAAFPLAPLLALLNNI 7 42 

582 VE I RL DARKF VCE YRRPVAERAQD I GI WFHI LAGLTHLAVI SNAFLLAF S S DFLPRAYYR 6 41 

: I I I I I I III : : I I I : I I I : I I I I I : II I : I : I I : I I I : : I : I I I : I I I 
743 IEIRLDAYKFVTQWRRPLASRAKDIGIWYGILEGIGILSVITNAFVIAITSDFIPRLVYA 8 02 

6 42 W TRAHDLRGFLNFTLA RAPS SFAAAHNRTCRYRAFR 6 77 

: : | : : | : | : I I : : I I I I : I 

803 YKYGPCAGQGEAGQKCMVGYVNASLSVFRISDFENRSEPESDGSEFSGTPLKYCRYRDYR 8 62 

678 DDDGH YSQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYY 732 

I I : : I : : I I I I I I : I I II I : I I : I : I : I I : I : : : : : I I I 

86 3 DPPHSLVPYGYTLQFWHVLAARLAFI IVFEHLVFCIKHLISYLIPDLPKDLRDRMRREKY 9 22 

733 LAKQALAENEVLFGTNGTKDEQPKGSELSSHW 764 

I : : : I I : I : : I : I 

923 LIQEMMYEAELERLQKERKERKKNGKAHHNEW 954 



RESULT 6 
Q1AP36_STRPU 

ID Q1AP36_STRPU Unreviewed; 903 AA. 

AC Q1AP36; 

DT ll-JUL-2006, integrated into UniProtKB/TrEMBL . 

DT ll-JUL-2006, sequence version 1. 

DT 24-JUL-2007, entry version 6. 

DE 122 kDa protein TMEM16 (Fragment) . 

OS Strongylocentrotus purpuratus (Purple sea urchin) . 

OC Eukaryota; Metazoa; Echinodermata; Eleutherozoa; Echinozoa; 

OC Echinoidea; Euechinoidea; Echinacea; Echinoida; Strongylocentrotidae; 

OC Strongylocentrotus. 
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OX NCBI_TaxID=7668; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Coelomocyte; 

RA Galindo B.E., Moy G.W., Vacquier V.D.; 

RT "A 122 kDa protein from Strongylocentrotus purpuratus embryo belongs 

RT to TMEM16 protein family."; 

RL Submitted (APR-2005) to the EMBL/ GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; DQ020165; AAY82886.1; -; mRNA. 

DR UniGene; Spu. 15325; -. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

FT NON_TER 1 1 

SQ SEQUENCE 903 AA; 104887 MW; 4 43 A4A4A7E5 0 0 74A CRC64; 



Query Match 35.8%; Score 1476.5; DB 2; Length 903; 

Best Local Similarity 40.2%; Pred. No. 4.2e-114; 

Matches 311; Conservative 144; Mismatches 268; Indels 51; Gaps 15; 

Qy 2 QDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLL 61 

: : III : : I I : : I I I : I : : I : I : : I : : I I I 

Db 114 EESHDGRT— FFVKVHAPWDLMTRYAEELKIKMPIEENNMEEPVNVFNCIDKLWTPFELS 171 

Qy 62 EVVPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLG 121 



Db 172 EEYVKPEPDVFTAPFIRDRASEFI-MESQDTFFPNNIRNRWYEILERMRYDANDPAKFG 230 

Qy 12 2 IHQLLAEGVLSAAFPLHDGPFKTP PEGPQAPRLNQRQVLFQHWARWGKWNKYQPLD 177 

I I : I I 11:111:1:1: Ml I I : I :: I I I I : I I I I I I 

Db 231 IDHLI ANGSYFAAYPLHEGDYKSKHSLLTHGPQ NDRHLLYEEWARPGRWYKKQPLD 2 86 

Qy 178 HVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEM 23 7 

: I I I I I I I I I I I I I I I I I I I : I I I I I : I : I I : I I I 

Db 287 LIRRYFGEKIGIYFCWLGFYTEMLTWAGFVGLIVFLYGCISLPSSVVVQEICDGTDII-M 345 

Qy 23 8 CPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRW 2 96 

I I I I I I : I I I : I : : Ml: I I I I : I I M I I : MMM Mil 

Db 346 CPLCDKRCPYWTLSDSCFYSKLTYLFDNEATVFFACFMSLWATMFCEFWKRRQNTIDYDW 405 

Qy 29 7 DCSDYEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVI VVMVAVVV 3 56 

I : I : I I I I : I I I I MM III : I I I MM: 

Db 406 DLFGFEEQEENIRPEFEAKAPDRRVSPITNLTEPYMKFSRKFPRFSASI ASIFFMILLVM 465 

Qy 357 MCLVS I ILYRAIMAIVVSRSGNTLLAAWASRIASLTGSVVNLVFILILSKIYVSLAHVLT 416 

: : : : I : I I : : : : : : : I I I I : I I : : : M M I I : I : I II 

Db 466 AAVMTVIVYRI VVKTAIFAIDQEFISSYASI ITSVTASMISLILIMILQILYERI AVWLT 525 

Qy 417 RWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGV-RNEEC 4 75 

I : I I I : I : : I I : I I M : M MM I I I I I I I I I I M MM I III 
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Db 526 NLELHRTETEYEDSFTFKMYLFAFVNYYSTSFYIAFFKGRLPGTPADYGRVFGIWRQEEC 585 



4 76 AAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGP 535 
I I : I I : : I I I I II I : : I I : I : I I : I : I 

58 6 DPAGCMQELFINI AITMCGKQFFNNFMELAMPVLMNFWRS-RTGRKEEKSGK GRYEQ 6 41 

53 6 WEDDYELVPC — E GLF DE YLEMVLQFGFVT I FVAACPL APLF AL LNNWVE I RLDARKF VC 5 93 

II I : I III 111111:1111 I I I I I I I I I I I I I I I I I I I : I I I I I I : 

6 42 WEQDADLADLGPRGLFKEYLEMVVQFGFSTIFVAAFPLAPLFALLNNLVEVRLDAYKFIS 701 

59 4 EYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPR AYYRWTRAHDLRG 6 50 

: I I I I I : I I I I I I I : I I : : I : I : : I I : : I I : I : I : I I II II 

70 2 QLRRPVAKRAQDIGAWYAILVTVGNLSVLTNALVIAFTSEFIPRQVFKYYYGGPEATLNG 761 

651 FLNFTLA RAPSSFAAAHNRT CRYRAFRDDDGHYSQT 686 

: I : : I : II I I I I I I : I I : I 

76 2 YTNWSLSYFNTVDMQNDSKPTDPSYPRVGDEDTTDPNYGLNVSVCRYRGNYDE— HYNVT 819 

68 7 — YWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQAL 73 8 

I I : : I I : I I I : : : : I I I : : : I I : I I I : : : I I I I I : I I I 

82 0 LDYWLVIAIKLAFILLYEHFVLFTKFFVAYI IPDMPEFVKNQIKRETYLGQQAL 8 73 



RESULT 7 
Q1AP35_STRPU 

ID Q1AP35_STRPU Unreviewed; 921 AA. 

AC Q1AP35; 

DT ll-JUL-2006, integrated into UniProtKB/TrEMBL . 

DT ll-JUL-2006, sequence version 1. 

DT 24-JUL-2007, entry version 6. 

DE 122 kDa protein TMEM16 . 

OS Strongylocentrotus purpuratus (Purple sea urchin) . 

OC Eukaryota; Metazoa; Echinodermata; Eleutherozoa; Echinozoa; 

OC Echinoidea; Euechinoidea; Echinacea; Echinoida; Strongylocentrotidae; 

OC Strongylocentrotus. 

OX NCBI_TaxID=7668; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Testis; 

RA Galindo B.E., Vacquier V.D.; 

RT "Phylogeny of the TMEM16 Protein Family: Some Are Over Expressed in 

RT Cancer."; 

RL Submitted (APR-2005) to the EMBL/ GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attr ibution-NoDerivs License 

CC 

DR EMBL; DQ020166; AAY82887.1; -; mRNA. 

DR UniGene; Spu. 15325; -. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

SQ SEQUENCE 921 AA; 106711 MW; C3F912A32528F059 CRC64; 

Query Match 35.8%; Score 1475.5; DB 2; Length 921; 
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Best Local Similarity 40.4%; Pred. No. 5.2e-114; 

Matches 311; Conservative 142; Mismatches 266; Indels 51; Gaps 15; 

Qy 6 DGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEVVP 6 5 

III : : I I : : I I I : I : : I : I : : I : : I I II 

Db 136 DGRT — FFVKVHAPWDLMTRYAEELKIKMPIEENNMEEPVNVFNCIDKLWTPFELSEEYV 193 

Qy 66 DVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQL 125 

I : : : I : : I : : : I I I I I : I : : : : : I I I : I II I 

Db 194 KPEPDVFTAPFIRDRASEFI-MESQDTFFPNNIRNRWYEILERMRYDANDPAKFGIDHL 252 

Qy 12 6 LAE GVLSAAFPLHDGPFKTP PEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVRR 181 

: I I | | : | | | : | : | : | | | I I : I :: I I I I : I I I I I I : I I 

Db 253 IANGSYFAAYPLHEGDYKSKHSLLTHGPQ NDRHLLYEEWARPGRWYKKQPLDLIRR 3 08 

Qy 182 YFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLC 241 

I I I I I :: I I I I I I I I I I 11:11111 : I : I I : I I I I I I I 
Db 309 YFGEKIGI YFCWLGFYTEMLTWAGFVGLIVFLYGCI SLPSSVVVQEICAGTDI I-MCPLC 367 

Qy 242 -LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 300 

I I : I I I : I : : III: I I I I : I I : I I I : I : I I I : I : I I I 
Db 36 8 DKRCPYWTLSDSCFYSKLTYLFDNEATVFFACFMSLWATMFCEFWKRRQNTIDYDWDLFG 42 7 

Qy 301 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSWIWMVAWVMCLV 360 

: I : I I I I : I I I I : I I I III : I I I I : : I : : : 

Db 428 FEEQEENIRPEFEAKAPDRRVSPITNLTEPYMKFSRKFPRFSASIAS IFFMILLVMAAVM 487 

Qy 361 SIILYRAIMAIVVSRSGNTLLAAWASRIASLTGSWNLVFILILSKIYVSLAHVLTRWEM 420 

: : I : I I : : : : : : : I I I I : I I : : : I : I : I I : I : I II I : 

Db 488 TVIVYRIVVKTAIFAIDQEFISSYASIITSVTASMISLILIMILQILYERIAVRLTNLEL 547 

Qy 421 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGV-RNEECAAGG 4 79 

I I I : I : : I I : I I I : : : I I I I : I I : I I I I I I I I I I : I : I I : I III I 
Db 548 HRTETEYEDSFTFKMYLFAFVNYYSTSFYIAFFKGRLPGTPADYGRVFGIWRQEECDPAG 607 

Qy 480 CLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDD 539 

I : I I : : I I I I II I : : I I : I : I I : I : I III 
Db 608 CMQELFINIAITMCGKQFFNNFMELAMPVLMNFWRS-RTGRKEEKSGK GRYEQWEQD 663 

Qy 540 YELVPC— EGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRR 597 

: I III 111111:1111 I I I I I I I I I I I I I I I I I I I : I I I I I I : : II 

Db 664 ADLADLGPRGLFKEYLEMVVQFGFSTIFVAAFPLAPLFALLNNLVEVRLDAYKFI SQLRR 723 

Qy 598 PVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPR AYYRWTRAHDLRGFLNF 654 

I I I : I I I I I I I : I I : : I : I : : I I : : I I : I : I : I I II I I : I : 

Db 724 PVAKRAQDIGAWYAILVTVGNLSVLTNALVIAFTSEFIPRQVFKYYYGGPEATLNGYTNW 783 

Qy 655 TLA RAP S S F AAAHNRT CRYRAFRDDDGHYSQT — YW 688 

: I : II I I I I I I : I I : I I I 

Db 78 4 SLSYFNTVDMQNDSKPTDPSYPRVGDEDTTDPNYGLNVSVCRYRGNYDE— HYNVTLDYW 8 41 

Qy 689 NLLAIRLAFVI VFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQAL 738 

: : I I : I I I : : : : I I I : : : I I : I I I : : : I I I II : I I I 

Db 842 LVIAIKLAFILLYEHFVLFTKFFVAYIIPDMPEFVKNQIKRETYLGQQAL 891 
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RESULT 8 
A2AHE5_MOUSE 

ID A2AHE5_MOUSE Unreviewed; 981 AA. 

AC A2AHE5; 

DT 20-FEB-2007, integrated into UniProtKB/TrEMBL . 

DT 20-FEB-2007, sequence version 1. 

DT 21-AUG-2007, entry version 6. 

DE Novel protein (B230324K02Rik) . 

GN Name=RP23-300M9.6; ORFNames=RP23-300M9 . 6-002 ; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Glires; Rodentia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RA Smith M. ; 

RL Submitted (JUL-2007) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RA Kay M. ; 

RL Submitted (JUL-2007) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP NUCLEOTIDE SEQUENCE. 

RA Phillimore B . ; 

RL Submitted (JUL-2007) to the EMBL/ GenBank/DDBJ databases. 

cc 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AL731700; CAM18325.1; -; Genomic_DNA. 

DR EMBL; AL731779; CAM18325.1; JOINED; Genomic_DNA. 

DR EMBL; BX005257; CAM18325.1; JOINED; Genomic_DNA. 

DR EMBL; AL731779; CAM25697.1; -; Genomic_DNA. 

DR EMBL; AL731700; CAM25697.1; JOINED; Genomic_DNA. 

DR EMBL; BX005257; CAM25697.1; JOINED; Genomic_DNA. 

DR EMBL; BX005257; CAM27817.1; -; Genomic_DNA. 

DR EMBL; AL731700; CAM27817.1; JOINED; Genomic_DNA. 

DR EMBL; AL731779; CAM27817.1; JOINED; Genomic_DNA. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 4: Predicted; 

SQ SEQUENCE 981 AA; 114568 MW; E851D80BC49F4977 CRC64; 

Query Match 35.7%; Score 1472; DB 2; Length 981; 

Best Local Similarity 39.7%; Pred. No. l.le-113; 

Matches 317; Conservative 156; Mismatches 264; Indels 62; Gaps 22; 

Qy 8 NTTVHYALLSASWAVLCYYAEDLRLKLPLQ ELPNQASNWSAGLLAWLGIP 5 7 

I : : : I I I I I I I : : : I : : : : I : : | : 

Db 202 NPDIMFIKIHIPWDTLCKYAERLNIRVPFRKKCYYTDQKNKSKSRVQNYFKRIKKWMSQN 261 

Qy 58 NVLLE — VVPDV-PPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPY — 112 

: : I : I : : : I : I : : I : : I : I I I I : : I : I : : : I : I I 

http://es/ScoreAccessWeb/GetItem.action?AppId=10552...0-552-5^^ 



SCORE Search Results Details for Application 105525 15 and Search Result 200S0624_135830_us-10-552-515-l_copy_157_933.rap. 



26 2 PMVLDKSAFPELEESDCYTGPFSRARIHHFI-INNKDTFFSNATRSRI VYHMLERTKYEN 3 2 0 

113 GHEKKNLLGIHQLLAE GVLSAAFPLHDGPFKT PPEGPQAPRLNQRQVLFQHWARWG 16 8 

I I : I I : I : I I I I I I : I : I : III I I : I : : I I I I I 

321 GISK VGIRKLITNGSYIAAFPPHEGAYKSSLPIKTHGPQ NNRHLLYE RWARWG 3 73 

16 9 KWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQEL 228 

I I : I I I I : I I I I I I : I I I I I I I : I I I I : I I I I I I II I : : I I : 

3 74 MWYKHQPLDLIRMYFGEKIGLYFAWLGWYTGMLIPAAVVGLCVFFYGLVTMNESQVSQEI 433 

22 9 CGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKR 2 8 7 

I :: I I I I I I : I I : : I I : I I I : I I I I I I : : I I I : I I : I I : I I I 

43 4 CKATEVF-MCPLCDKNCSLQRLNDSCIYAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKR 4 92 

28 8 KSATLAYRWDCSDYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGSV 3 46 

: : I I I I : : I : I I I I I I I I I I I I I : I I : I : I : I 
49 3 RRSILTYTWDLIEWEEEEETLRPQFEAKYYRMEVINPITGKPEPHQPSSDKVTRLLVSVS 552 

34 7 VIVVMVAVVVMCLVSIILYR-AIMAIVVSRSGNTLLAAWASRIASLTGSV-VNLVFILIL 404 

I I : : : I : : : : : : I I : I I I : I : I : : I : I : I : : I 

553 GIFFMISLVITAVFAVWYRLWMEQFASFKWNFVKQHW— QFATSGAAVCINFI I IMLL 610 

40 5 SKI YVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNY 46 4 

: I : I : : I I I I I : : : : I : : I I I : I : I I I I I II I I I I I I I I I I : I I I 
611 NLAYEKIAYLLTNLEYPRTESEWENSFALKMFLFQFVNLNSS IFYIAFFLGRFVGHPGKY 6 70 

465 HTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKR 523 

: I I I I I I I I I I : I : : III I I : I I I : I : : I I : : : 
671 NKLFERWRLEECHPSGCLIDLCLQMGVIMFLKQIWNNFMELGYPLIQNWWSRHKI 725 

52 4 KAGASAGASQGPWEDDYELVP — CEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNW 581 

II II I I : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

72 6 KRGIQ-DASIPQWENDWNLQPMNIHGLMDEYLEMVLQFGFTTIFVAAFPLAPLLALLNNI 784 

58 2 VEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYR 6 41 

: I I I I I I III : : I I I : II I I I I I II I : I I I I : I I I : : I : I I : : I I I 
78 5 IEIRLDAYKFVTQWRRPLPARATDIGIWLGILEGIGILAVITNAFVI AITSDYIPRFVYE 8 44 

642 W TRAHDLRGFLNFTLARAP-SSFAAAHNRTCRYRAFR DDDGHYSQ 685 

: : | : | : : | : | : | : I I I I : I :: 

8 45 YKYGPCANHVKQNENCLKGYVNNSLSFFDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTL 9 04 

686 TYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEVLF 7 45 

I I : : I I I I I I : I I I I I : I I : : I : I I I I : : : : : I I II : : : I I : 
905 QYWHILAARLAFI IVFEHLVFGIKSFIAYLIPDIPKGLRERIRREKYLVQEMMYEAEL — 9 62 

746 GTNGTKDEQPKGSELSSHW 764 

: : : I : I 
963 -EHLQQQRRKSGQPIHHEW 980 



RESULT 9 
TM1 6 C_HUMAN 

ID TM1 6 C_HUMAN Reviewed; 981 AA. 

AC Q9BYT9; 
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DT 16-JAN-2004, integrated into UniProtKB/Swiss-Prot . 

DT 01-JUN-2001, sequence version 1. 

DT 24-JUL-2007, entry version 31. 

DE Transmembrane protein 16C. 

GN Name=TMEM16C; Synonyms=Cl lorf 25; ORFNames=GENX-3 947; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 

OC Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [ MRNA ] . 

RA Rosier M.F., Toselli E., Segurens-Soury B., Auffray C, Devignes M.D.; 

RT "Predominant brain expression and full-length characterization of a 

RT novel human 6.6-Kb transcript mapping at llpl4 in the telomeric part 

RT of WAGR locus. "; 

RL Submitted (NOV-2000) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION: Membrane; Multi-pass membrane protein 

CC (Probable) . 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attr ibution-NoDer ivs License 

CC 

DR EMBL; AJ300461; CAC32454.1; -; mRNA. 

DR UniGene; Hs. 91791; -. 

DR Ensembl; ENSG00000134343 ; Homo sapiens. 

DR KEGG; hsa:63982; -. 

DR HGNC; HGNC:14004; TMEM16C . 

DR MIM; 610110; gene. 

DR PharmGKB; PA2 5 48 9; -. 

DR ArrayExpress ; Q9BYT9; -. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

KW Membrane; Transmembrane. 



FT 


CHAIN 


1 


981 


Transmembrane protein 16C 


FT 








/FTId=PRO_0000 0 72565. 


FT 


TRANSMEM 


398 


420 


Potential . 


FT 


TRANSMEM 


471 


490 


Potential . 


FT 


TRANSMEM 


553 


575 


Potential . 


FT 


TRANSMEM 


590 


612 


Potential . 


FT 


TRANSMEM 


642 


664 


Potential . 


FT 


TRANSMEM 


759 


781 


Potential . 


FT 


TRANSMEM 


809 


831 


Potential . 


FT 


TRANSMEM 


904 


926 


Potential . 


SQ 


SEQUENCE 


9 81 AA; 


114655 


MW; 15A3276420912393 CRC64, 



Query Match 35.6%; Score 1467.5; DB 1; Length 981; 

Best Local Similarity 40.9%; Pred. No. 2.6e-113; 

Matches 313; Conservative 149; Mismatches 245; Indels 59; Gaps 21; 

Qy 2 0 WAVLCYYAEDLRLKLPLQ ELPNQASNWSAGLLAWLGIPNVLLE— VVPDV 6 7 

I I I I I I I : : : I : : : : : I : : : I : I I : 

Db 214 WDTLCKYAERLNIRMPFRKKCYYTDGRSKSMGRMQTYFRRIKDWMAQNPMVLDKSAFPDL 2 73 
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6 8 -PPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPY — GHEKKNLLGIHQ 124 
: I : I : : I : : I : I I I I : : I : I : : : I : I I I I : I I : 
2 74 EESDCYTGPFSRARIHHFI-INNKDTFFSNATRSRIVYHMLERTKYENGISK VGIRK 329 

12 5 LLAEGVLSAAFPLHDGPFKT PPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 

I : I | | | | | : | : | : I I I : I :: I I I I I I I : I I I I : I 

33 0 LINNGSYI AAFPPHEGAYKSSQPIKTHGPQ NNRH L L YE RW ARW GMWYKHQPLDLIR 3 85 

181 RYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPL 2 40 

I I I I I : I I I I I I I : I I I I : I II : I I II I I : : : I I : I : : I I I I I 
386 LYFGEKIGLYFAWLGWYTGMLIPAAIVGLCVFFYGLFTMNNSQVSQEICKATEVF-MCPL 4 44 

2 41 C-LDCPFWLLS SACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCS 2 9 9 

I : I I : : I I : I I I : I I I I I I : : I I I : I I : I I : I I I : : I I I I 

445 CDKNCSLQRLNDSCIYAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKRRRSILTYTWDLI 504 

30 0 DYEDTEERPRPQFAAS-APMTAPNPITGEDEPYFPERSRARRMLAGS VVI VVMVAVVVMC 3 58 

: : I : I I I I I I I I I I I I I : I I : I : I : I I I : : : I : 

50 5 EWEEEEETLRPQFEAKYYKMEIVNPITGKPEPHQPSSDKVTRLLVSVSGIFFMISLVITA 56 4 

35 9 LVSIILYR-AIMAIWSRSGNTLLAAWASRIASLTGSV-VNLVFILILSKIYVSLAHVLT 416 

: : : : I I : I I I : I : I : : I : I : I : : I : I : I : : I I 

56 5 VF G VVVYRL VVME QF ASFKWNF I KQ YW — QFATSAAAVCINFI I IMLLNLAYEKI AYLLT 6 22 

417 RWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFG-VRNEEC 4 75 

I I I : : : : I : : I I I : I : I I I I I II I I I I I I I I I I : I I I : I I I I I I 
62 3 NLEYPRTESEWENSFALKMFLFQFVNLNSSIFYIAFFLGRFVGHPGKYNKLFDRWRLEEC 6 82 

4 76 AAGGC LIE L AQEL L VI MVGKQVINNMQEVL I PKLKGWWQKFRLRSKKRKAGAS AGAS QGP 535 

I I I I : I : : I I I I I : I I I : I : : I I : : : II II 

683 HPSGCLIDLCLQMGVIMFLKQIWNNFMELGYPLIQNWWSRHKI KRGIH-DASIPQ 736 

53 6 WEDDYELVP— CEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVC 5 93 

I I : I : I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I III 

73 7 WENDWNLQPMNLHGLMDEYLEMVLQFGFTTIFVAAFPLAPLLALLNNI IEIRLDAYKFVT 796 

59 4 EYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYRW T 6 43 

: : I I I : II I I I I I II I : I I I I : I I I : : I : I I : : I I I : 
79 7 QWRRPLPARATDIGIWLGILEGIGILAVITNAFVIAITSDYIPRFVYEYKYGPCANHVEP 856 

644 RAH DLRGFLNFTL ARAP - S S F AAAHNRT C RYRAF R DDDGHYSQTYWNLLAIRLAF 697 

: I : I : : I : I : I : III: : : I I :: I I I I I I 

85 7 SENCLKGYVNNSLSFFDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTLQYWHILAARLAF 916 

698 VIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLAKQALAENEV 743 

: I I I I I : I I : : I : I I : I : : : : : I I I I : : : I I : 
917 IIVFEHLVFGIKSFIAYLIPDVPKGLHDRIRREKYLVQEMMYEAEL 962 



RESULT 10 
A1A5Z4_DANRE 

ID A1A5Z4_DANRE Unreviewed; 900 AA. 

AC A1A5Z4; 

DT 23-JAN-2007, integrated into UniProtKB/TrEMBL . 
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DT 23-JAN-2007, sequence version 1. 

DT 24-JUL-2007, entry version 6. 

DE Zgc:158430. 

GN ORFNames=zgc: 158430; 

OS Danio rerio (Zebrafish) (Brachydanio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7955; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Kidney; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F . , 

RA Diatchenko L . , Marusina K., Farmer A. A., Rubin G.M., Hong L . , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Kidney; 

RG NIH MGC Project; 

RL Submitted (DEC-2006) to the EMBL/ GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BC128870; AAI28871.1; -; mRNA. 

DR UniGene; Dr. 108272; -. 

DR ZFIN; ZDB-GENE-061215-108 ; zgc:158430. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

SQ SEQUENCE 900 AA; 105263 MW; DB8 656 7CCE3D1 53E CRC64; 

Query Match 35.5%; Score 1463; DB 2; Length 900; 
Best Local Similarity 39.0%; Pred. No. 5.6e-113; 

Matches 308; Conservative 159; Mismatches 244; Indels 78; Gaps 18; 

Qy 12 HYALLSASWAVLCYYAEDLRLKLPL QELPNQASNWSAGLLAWLGIPNVLLEV 6 3 
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II : I I I I I I : I : : I : I : I : I III I II 

9 8 HYLKIHAPWEVLATYADVLKIKVPFKVSDIPKAREVP LEWLSHPFRLPEN 147 

6 4 VPDVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHE KKNLL 120 

: I : I : : I : I : II I : : I I I I : I : : I : : I I : I I I : I : 

148 IMRPEPDYFTAPFDKSKVDFFL-IDDKDTFFPPSTRNRI VYYILTRCPYYKEDRKEKDKT 2 06 

121 GIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDHVR 180 
I I : : I I I : : I : I I I I : : I : : I I : : : I I I : : I I I I : : : 

2 0 7 GINRLLNNGTYTS AYPLHDCRYWKKAQDMQCE — SERYHLYRYWARFLCFYKEQPLNLIK 2 6 4 

181 RYFGEKVALYFAWLGF YTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSK — DSFEMC 2 3 8 

: I : I I I :: I I I I I I I I I I I I I : I : I : I : I : : I : I I II 

26 5 KYYGEKIGI YFAWLGFYTEMLFYAAVMGVICFVYGVLSYEDNITSKEICDPKIGGMI VMC 324 

23 9 PLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWD 297 

III I : I I : I I : I I I : I I I I I : : I I : I I I I : I I I : I I I I I 

32 5 PLCDKKCSYWKLNSTCLSSWQSHLFDNEGTVFFAMFMGIWVTLFLEFWKRRQARLEYEWD 3 84 

29 8 CSDYEDTEE — RPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVI VVMVAVV 3 55 

I : I : : : : II:: I I I I I I I I I : I :: : : 

385 LVDFEEEQQQLQIRPEYEQKCTGRRLNRITQEMEPYLPFPSKCARFCLSGATVLFWTCLI 4 44 

3 56 VMCLVSIILYR AIMAIVVSRSGNTLLAAWASRI ASLTGSWNLVFILILSK 4 06 

I I : : : I I I I I I : I : : II I : I I : I I I I I I : 

44 5 VACIMGVIAYRLAVYAAFASVMKDSSTSKIQLVGSLITPQLATSVTASCINFVIILILNF 504 

40 7 IYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHT 466 

: I : I : I I : : I : : I : I : I : I : I I I I I : I I I I : I I I I I : I I I I I I I I 
505 LYEHVAIWITDMEIPKTHLEYENKLTMKMFMFQFVNYYSSCFYVAFFKGKFVGYPGNYSY 564 

46 7 LFG VRNEECAAGGCLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKK 522 

: I I : I I I I I I I I I I II I : I I : : I III:: I : I I I : I : : II 
56 5 MFGKWSTLRNEECAPGGCLIELTTQLLIVMAGKQMVGNVQEALLPLVRNWW SS 617 

52 3 RKAGASAGASQGPWEDDYELVPCE — GLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNN 580 

II : : : I I I : : I I I I I I I I I I : I I I I : I : I I I : I I I I I I I I I 

618 RKGRSHPESTYSRWEQDHDLQNFSQFGLFYEYLEMVIQFGFITLFVASFPLAPLLALFNN 6 77 

581 WVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVI SNAFLLAFSSDFLPR 63 7 

: I : I : I I II : : I I I : I : I : : I I I II : : : I : : I I I : : I I : I I : I I 
6 7 8 I LE VRVDAWKF TTQFRRPMAAKARNI GAWEE I LNWAIMS VVTNAF I MAFT S DMI PRLVY 73 7 

63 8 — AYYRWTRAHDLRGFLNFTLA RAP S S F AAAHNRT C RYR AF RD D 6 79 

II: | : : | : : : | : III:: III: 

73 8 LYAYHPGIEA-NMTGYITNSLSI YNISQIPEDNLPEAGENPSWFNSSTITTCRYRDYRYP 7 96 

68 0 DGH YSQT — YWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYLA 73 4 

II I : I : I : : I I : I I I : I : I I I I I I : I : I I : I I : : : I I I : I 
79 7 PGHLRQYTHTMQFWHILAAKLAFIIIMEHWFWKFFVAWLIPDVPSEVKARIKRERFLV 856 

73 5 KQALAENEV 7 43 

: : I II 
85 7 QEYLHNYEV 8 65 
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RESULT 11 
Q1AP3 7_STRPU 

ID Q1AP37_STRPU Unreviewed; 921 AA. 

AC Q1AP37; 

DT ll-JUL-2006, integrated into UniProtKB/TrEMBL . 

DT ll-JUL-2006, sequence version 1. 

DT 24-JUL-2007, entry version 6. 

DE 122 kDa protein TMEM16 . 

OS Strongylocentrotus purpuratus (Purple sea urchin) . 

OC Eukaryota; Metazoa; Echinodermata; Eleutherozoa; Echinozoa; 

OC Echinoidea; Euechinoidea; Echinacea; Echinoida; Strongylocentrotidae; 

OC Strongylocentrotus. 

OX NCBI_TaxID=7668; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RA Galindo B.E., Moy G.W., Vacquier V.D.; 

RT "A 122 kDa protein from Strongylocentrotus purpuratus embryo belongs 

RT to TMEM16 protein family."; 

RL Submitted (APR-2005) to the EMBL/ GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attr ibution-NoDer ivs License 

CC 

DR EMBL; DQ020164; AAY828 85.1; -; mRNA. 

DR UniGene; Spu. 15325; -. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

SQ SEQUENCE 921 AA; 106699 MW; 9A516A5DCA1ADCCC CRC64; 

Query Match 35.4%; Score 1460.5; DB 2; Length 921; 

Best Local Similarity 40.1%; Pred. No. 9.4e-113; 

Matches 309; Conservative 140; Mismatches 270; Indels 51; Gaps 15; 

Qy 6 DGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLEVVP 6 5 

III : : I I : : I I I : I : : I : I : : I : : I I II 

Db 136 DGRT — FFVKVHAPWDLMTRYAEELKIKMPIEENNMEEPVNVFNCIDKLWTPFELSEEYV 193 

Qy 66 DVPPEYYSCRFRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTPYGHEKKNLLGIHQL 125 

I : : : I : : I : : : I I I I I : I : : : : : I I I : I II I 

Db 194 KPEPDVFTAPFIRDRASEFI-MESQDTFFPNNIRNRWYEILERMRYDANDPTKFGIDHL 252 

Qy 12 6 LAE GVLSAAFPLHDGPFKTP PEGPQAPRLNQRQVLFQHWARWGKWNKYQPLDH VRR 181 

: I I | | : | | | : | : | : I I I I I : I : : I I I I : I I I I I I : I 

Db 253 IANGSYFAAYPLHEGDYKSKHSLLTHGPQ NDRHLLYEEWARSGRWYKKQPLDLIRC 308 

Qy 182 YFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKDSFEMCPLC 241 

I I I I I :: I I I I I I I I I I 11:11111 : I : I I : I I I I I I I 
Db 309 YFGEKIGI YFCWLGFYTEMLTWAGFVGLIVFLYGCI SLPSSVVVQEICAGTDI I-MCPLC 367 

Qy 242 -LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATLAYRWDCSD 300 

I I : I I I : I : : III: I I I I : I I : I I I : I : I I I : I : I I I 
Db 36 8 DKRCPYWTLSDSCFYSKLTYLFDNEATVFFACFMSLWATMFCEFWKRRQNTIDYDWDLFG 42 7 
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Qy 301 YEDTEERPRPQFAASAPMTAPNPITGEDEPYFPERSRARRMLAGSVVIVVMVAVVVMCLV 360 



Db 428 FEEQEENIRPEFEAKAPDRRVSPITNLPEQYMKFSKRFPRFSTSVPTIAFMILLVMAAVM 487 

Qy 361 SI ILYRAIMAI VVSRSGNTLLAAWASRIASLTGSVVNLVFILILSKI YVSLAHVLTRWEM 420 

: : I : I I : : : : : : : I I I I : I I : : : I : I : I I : I : I II I : 

Db 488 TVI VYRIVVKTAIFAIDQEFI SSYASIITSVTASMI SLILIMILQILYERIAVWLTNLEL 547 

Qy 421 HRTQTKFEDAFTLKVFIFQFVNFYSSPVYIAFFKGRFVGYPGNYHTLFGV-RNEECAAGG 4 79 

I I I : I : : I I : I I I : : : I I I I : I I : I I I I I I I I I I : I : I I : I I I I I 

Db 54 8 HRTETEYEDSFTFKMYLFAFVNYYSTSFYIAFFKGRLPGTPADYGRVFGIWRQEECDPAG 607 

Qy 480 CLIELAQELLVIMVGKQVINNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDD 539 

I II : : | | | | || I : : I I : I : I I : I : I III 

Db 608 CTQELFINIAITMCGKQFFNNFMELAMPVLMNFWRS-RTGRKEEKSGK GRYEQWEQD 663 

Qy 540 YELVPC— EGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRR 597 

: I III 111111:1111 I I I I I I I I I I I I I I I I I I I : I I I I I I : : II 

Db 664 ADLADLGPRGLFKEYLEMVVQFGFSTIFVAAFPLAPLFALLNNLVEVRLDAYKFI SQLRR 723 

Qy 598 PVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPR AYYRWTRAHDLRGFLNF 654 



Db 724 PVAKRAQDIGAWYAILVTVGNLSVLTNALVIAFTSEFIPRQVFKYYYGGPEATLNGYTNW 783 

Qy 655 TLA RAP S S F AAAHNRT CRYRAFRDDDGHYSQT — YW 688 

: I : II I I I I I I : I I : I I I 

Db 78 4 SLSYFNTVDMQNDSKPTDPSYPRVGDEDTTDPNYGLNVSVCRYRGNYDE — HYNVTLDYW 8 41 

Qy 689 NLLAIRLAFVIVFEHVVFSVGRLLDLLVPDIPESVEIKVKREYYLAKQAL 738 

: : I I : I I I : : : : I I I : : : I I : I I I : : : I I I I I : I I I 

Db 842 LVIAIKLAFILLYEHFVLFTKFFVAYIIPDMPEFVKNQIKRETYLGQQAL 891 

RESULT 12 
TM16A_MOUSE 



ID TM16A_MOUSE Reviewed; 960 AA. 

AC Q8BHY3; Q6P5C6; Q8BI26; Q99JK1; 

DT 29-MAY-2007, integrated into UniProtKB/Swiss-Prot . 

DT 29-MAY-2007, sequence version 2. 

DT 24-JUL-2007, entry version 32. 

DE Transmembrane protein 16A. 

GN Name=Tmeml6a; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Glires; Rodentia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). 

RC STRAIN=C57BL/6J; TISSUE=Head, and Kidney; 

RX PubMed=16141072; DOI=10 . 1126/science. 1112014; 

RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., 

RA Oyama R., Ravasi T., Lenhard B., Wells C, Kodzius R. , Shimokawa K., 

RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., 

RA Davis M.J., Wilming L.G., Aidinis V., Allen J.E., 
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RA Ambesi-Impiombato A., Apweiler R. , Aturaliya R.N., Bailey T.L., 

RA Bansal M. , Baxter L . , Beisel K.W., Bersano T., Bono H., Chalk A.M., 

RA Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., 

RA Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Delia Gatta G . , 

RA di Bernardo D., Down T . , Engstrom P., Fagiolini M . , Faulkner G., 

RA Fletcher C.F., Fukushima T., Furuno M. , Futaki S., Gariboldi M . , 

RA Georgii-Hemming P., Gingeras T.R., Gojobori T . , Green R.E., 

RA Gustincich S., Harbers M. , Hayashi Y., Hensch T.K., Hirokawa N . , 

RA Hill D., Huminiecki L., Iacono M. , Ikeo K . , Iwama A., Ishikawa T., 

RA Jakt M. , Kanapin A., Katoh M. , Kawasawa Y., Kelso J., Kitamura H., 

RA Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., 

RA Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L . , Liu J., 

RA Liuni S., McWilliam S., Madan Babu M . , Madera M. , Marchionni L . , 

RA Matsuda H., Matsuzawa S., Miki H., Mignone F . , Miyake S., Morris K., 

RA Mottagui-Tabar S., Mulder N . , Nakano N . , Nakauchi H., Ng P . , 

RA Nilsson R., Nishiguchi S., Nishikawa S., Nori F . , Ohara 0., 

RA Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G. , Pesole G., 

RA Petrovsky N. , Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., 

RA Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C, 

RA Schoenbach C, Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., 

RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., 

RA Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y. , Taki K., 

RA Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., 

RA Ueda H.R., van Nimwegen E., Verardo R. , Wei C.L., Yagi K., 

RA Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C, 

RA Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., 

RA Wahlestedt C, Mattick J.S., Hume D.A., Kai C, Sasaki D., Tomaru Y., 

RA Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., 

RA Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., 

RA Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., 

RA Nishio T., Okada M., Plessy C, Shibata K. , Shiraki T., Suzuki S., 

RA Tagami M. , Waki K., Watahiki A., Okamura-Oho Y. , Suzuki H., Kawai J., 

RA Hayashizaki Y. ; 

RT "The transcriptional landscape of the mammalian genome."; 

RL Science 309:1559-1563(2005) . 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). 

RC STRAIN=C57BL/6, and Czech II; TISSUE=Eye, and Mammary tumor; 

RX PubMed=15489334; DOI=10 . 1101/gr . 2596504; 

RG The MGC Project Team; 

RT "The status, quality, and expansion of the NIH full-length cDNA 

RT project: the Mammalian Gene Collection (MGC)."; 

RL Genome Res. 14:2121-2127(2004). 

CC -!- SUBCELLULAR LOCATION: Cell membrane; Multi-pass membrane protein 
CC (By similarity) . Cytoplasm (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q8BHY3-l; Sequence=Displayed; 

CC Name=2; 

CC IsoId=Q8BHY3-2; Sequence=VSP_0256 72 ; 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC -!- SEQUENCE CAUTION: 

CC Sequence=BAC35051 . 1 ; Type=Miscellaneous discrepancy; Note=Wrong choice of frame; 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 
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EMBL; 
EMBL; 



AK028991; BAC26230.1; ALT_INIT; 
AK029329; BAC26398.1; ALT_INIT; 



mRNA. 
mRNA. 



DR EMBL; AK052589; BAC35051.1; ALT_SEQ; mRNA. 

DR EMBL; BC006062; AAH06062.1; ALT_INIT; mRNA. 

DR EMBL; BC062959; AAH62959.1; -; mRNA. 

DR UniGene; Mm. 26700; -. 

DR Ensembl; ENSMUSGO 000 00310 75 ; Mus musculus. 

DR MGI; MGI:2142149; Tmeml6a. 

DR ArrayExpress; Q8BHY3 ; -. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

KW Alternative splicing; Cytoplasm; Glycoprotein; Membrane; 

KW Transmembrane. 



FT 


CHAIN 


1 


960 


Transmembrane protein 16A. 


FT 








/FTId=PRO_0000288436 . 


FT 


TOPO_DOM 


1 


333 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


334 


354 


Potential . 


FT 


TOPO_DOM 


355 


402 


Extracellular (Potential). 


FT 


TRANSMEM 


403 


423 


Potential . 


FT 


TOPO_DOM 


424 


493 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


494 


514 


Potential . 


FT 


TOPO_DOM 


515 


539 


Extracellular (Potential) . 


FT 


TRANSMEM 


540 


560 


Potential . 


FT 


TOPO_DOM 


561 


581 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


582 


602 


Potential . 


FT 


TOPO_DOM 


603 


705 


Extracellular (Potential) . 


FT 


TRANSMEM 


706 


726 


Potential . 


FT 


TOPO_DOM 


727 


765 


Cytoplasmic (Potential). 


FT 


TRANSMEM 


766 


786 


Potential . 


FT 


TOPO_DOM 


787 


856 


Extracellular (Potential). 


FT 


TRANSMEM 


857 


877 


Potential . 


FT 


TOPO_DOM 


878 


960 


Cytoplasmic (Potential). 


FT 


CARBOHYD 


806 


806 


N-linked (GlcNAc. . .) (Pot 


FT 


VAR_SEQ 


448 


451 


Missing (in isoform 2). 


FT 








/FTId=VSP_0256 72 . 


SQ 


SEQUENCE 


960 AA; 


110916 


MW; BFD0112FD310CE88 CRC64; 



Query Match 3 5.2 s , 

Best Local Similarity 40.0 s , 
Matches 330; Conservative 



; Score 1452.5; DB 1; 
; Pred. No. 4.6e-112; 
156; Mismatches 257; 



Length 960; 
Indels 81; Gaps 



6 DGNTTVH YALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLLE 6 2 

I : I : I : : I I I I I I I I : I I : I : : : : : : I I I I : I I : 

145 DEDTKIHGVGFVKIHAPWHVLCREAEFLKLKMPTKKVYHISE — TRGLLK— TINSVLQK 2 00 

63 VVPDVPPEYYSCRFRVNKLPRFLGS DNQDTFFTSTKRHQILFEILAKTPYG 113 

: : I : I : I : I : : I : I I I I I : : I I I : I 

201 ITDPIQPKVAEHRPQTTKRLSYPFSREKQHLFDLTDRDSFFDSKTRSTI VYEILKRTTCT 260 



114 HEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWNKY 173 
I : : I I I I I I I I I I : I I I I I : II I I : : I : : I I : I : I I 
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Db 261 KAKYS -MG I T S LL ANGVYS AAYPLHDGDY EGDNV-EFNDRKLLYEEWASYGVFYKY 314 



174 QPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGSKD 2 33 

I I : I 11:111111 I I I I I I I II I : I I : : I I : I I I II I : I I : I : I : 
315 QPIDLVRKYFGEKVGLYFAWLGAYTQMLIPASIVGVIVFLYGCATVDENIPSMEMCDQRY 3 74 

23 4 SFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSATL 2 92 

: I I I I I I : I : I I I I I I : I III: I I I I I : I I I I I I : I : I I I I I 
3 75 NITMCPLCDKTCSYWKMSSACATARASHLFDNPATVFFSVFMALWAATFMEHWKRKQMRL 43 4 

293 AYRWDCSDYEDTEE RPRPQFAA SAPMTAPNPITGEDEPYFPERSRARRMLA 3 43 

I I I I : : I : I I I I : : I I : I I I : II 

43 5 NYRWDLTGFEEEEEAVKDHPRAEYEARVLEKSLRKESRNKET — DKVKLTWRDRFPAYFT 4 92 

3 4 4 GSVVI VVMVAVVVMCLVS I IL YRAIMAI WSRSGNTLLAAWASRI ASLTGSVVNLVF ILI 4 03 

I I : I : I I : : : I : I I I : : : : : : : I : : I I I I : : 

49 3 NLVSI IFMIAVTFAIVLGVI I YRISTAAALAMNSSPSVRSNIRVTVTATAVI INLVVI IL 552 

40 4 LSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYSSPVYI AFFKGRFVGYPGN 463 

I : : I : I I I : I : : I : I I : I I I : : I I I I : I : I I I I I I I I I I I : 
553 LDEVYGCI ARWLTKIEVPKTEKSFEERLTFKAFLLKFVNSYTPIFYVAFFKGRFVGRPGD 612 

46 4 YHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNMQEVLIPKLKGWWQKFRLRSK 521 

I : I 111111111:11 : I : I I : I I I : I I I : I : I I I : I : : : I I : 
613 YVYIFRSFRMEECAPGGCLMELCIQLSIIMLGKQLIQNNLFEIGIPKMKKFIRYLKLRRQ 6 72 

52 2 KRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVTIFVAACPLAPLFALLNNW 581 

: : I I : I I II I I : I I : : I I I I I I : I I I : I I I I I I I I I I I 
6 73 SPSDREEYVKRKQRYEVDFNLEPFAGLTPEYMEMIIQFGFVTLFVASFPLAPLFALLNNI 732 

582 VEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVISNAFLLAFSSDFLPRAYYR 6 41 

: I I I I I I : I I I I I I I I I I I : I I I I I :: I I I : I I I I I I I : : : I : I I I : I I I 
73 3 IEIRLDAKKFVTELRRPVAIRAKDIGIWYNILRGVGKLAVI INAFVI SFTSDFIPRLVYL 7 92 

642 WTRAHD— LRGFLNFTLARAPSSF AAAHN RTCRYRAFRD DD 680 

: : : : I I : I I I III II : I I I : : I : : 

79 3 YMYSQNGTMHGFVNHTL SSFNVSDFQNGTAPNDPLDLGYEVQICRYKDYREPPWSE 8 48 

681 GHY— SQTYWNLLAIRLAFVIVFEHWFSVGRLLDLLVPDIPESVEIKVKREYYL 733 

I I : : I : I I I I I I I I I I : : : I : : I : : I I I I : : : : : I I 
8 4 9 HKYDI SKDFWAVLAARLAFVI VFQNLVMFMSDFVDWVIPDIPKDI SQQIHKEKVLMVELF 9 0 8 

734 AKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKA 772 

III: : : I : I : : : I : I I I : I 

90 9 MREEQGKQQLLDTWM EKEKPRDVPCNNH-SPTTHPEA 944 



RESULT 13 
TM16A_HUMAN 

ID T M 1 6 A_H UM AN Reviewed; 986 AA. 

AC Q5XXA6; Q8IYY8; Q8N7V3 ; 

DT 29-MAY-2007, integrated into UniProtKB/Swiss-Prot . 
DT 23-NOV-2004, sequence version 1. 
DT 21-AUG-2007, entry version 18. 

DE Transmembrane protein 16A (Discovered on gastrointestinal stromal 
DE tumors protein 1) (Oral cancer overexpressed protein 2) (Tumor 
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DE amplified and overexpressed sequence 2). 

GN Name=TMEMl 6 A ; Synonyms=DOGl , 0RA0V2, TAOS2; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 

OC Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), AND TISSUE SPECIFICITY. 

RX PubMed=16906560; DOI=10 . 1002/gcc . 20371; 

RA Huang X., Godfrey T.E., Gooding W.E., McCarty K.S. Jr., Gollin S.M.; 

RT "Comprehensive genome and transcriptome analysis of the llql3 amplicon 

RT in human oral cancer and synteny to the 7F5 amplicon in murine oral 

RT carcinoma."; 

RL Genes Chromosomes Cancer 45:1058-1069(2006). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3). 

RC TISSUE=Testis; 

RX PubMed=14702039; DOI=10 . 1038/ngl285 ; 

RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R. , 

RA Wakamatsu A., Hayashi K., Sato H., Nagai K. , Kimura K., Makita H., 

RA Sekine M. , Obayashi M. , Nishi T., Shibahara T., Tanaka T., Ishii S., 

RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., 

RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., 

RA Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., 

RA Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y . , 

RA Abe K., Kamihara K., Katsuta N . , Sato K., Tanikawa M. , Yamazaki M., 

RA Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., 

RA Tanai H., Kimata M., Watanabe M. , Hiraoka S., Chiba Y . , Ishida S., 

RA Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., 

RA Kanehori K., Takahashi-Fu j ii A., Hara H., Tanase T . -0 . , Nomura Y. , 

RA Togiya S., Komai F., Hara R., Takeuchi K., Arita M. , Imose N., 

RA Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., 

RA Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., 

RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki 0., 

RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., 

RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M. , Kawakami B., 

RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., 

RA Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., 

RA Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y. , 

RA Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., 

RA Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., 

RA Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., 

RA Togashi T., Oyama M. , Hata H., Watanabe M., Komatsu T., 

RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., 

RA Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R. , 

RA Nakai K., Yada T., Nakamura Y., Ohara 0., Isogai T., Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 

RN [3] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). 

RC TISSUE=Testis; 

RX PubMed=15489334; DOI=10 . 1101/gr . 2596504; 

RG The MGC Project Team; 
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RT "The status, quality, and expansion of the NIH full-length cDNA 

RT project: the Mammalian Gene Collection ( MGC ) . " ; 

RL Genome Res. 14:2121-2127(2004). 

RN [4] 

RP IDENTIFICATION, AND TOPOLOGY. 

RX PubMed=12739008; 

RA Katoh M. , Katoh M. ; 

RT "FLJ10261 gene, located within the CCND1-EMS1 locus on human 

RT chromosome llql3, encodes the eight-transmembrane protein homologous 

RT to C12orf3, Cllorf25 and FLJ34272 gene products."; 

RL Int. J. Oncol. 22:13 75-13 81(20 03). 

RN [5] 

RP SUBCELLULAR LOCATION, AND TISSUE SPECIFICITY. 

RX PubMed=15215166; 

RA West R.B., Corless C.L., Chen X., Rubin B.P., Subramanian S., 

RA Montgomery K., Zhu S., Ball C.A., Nielsen T.O., Patel R. , 

RA Goldblum J.R., Brown P.O., Heinrich M.C., van de Rijn M. ; 

RT "The novel marker, DOG1, is expressed ubiquitously in gastrointestinal 

RT stromal tumors irrespective of KIT or PDGFRA mutation status."; 

RL Am. J. Pathol. 165:107-113(2004). 

RN [6] 

RP SUBCELLULAR LOCATION. 

RX PubMed=16261155; DOI=10 . 1038/sj . one . 1209203 ; 

RA Carles A., Millon R. , Cromer A., Ganguli G . , Lemaire F., Young J., 

RA Wasylyk C, Muller D., Schultz I., Rabouel Y. , Dembele D., Zhao C, 

RA Marchal P., Ducray C, Bracco L., Abecassis J., Poch 0., Wasylyk B . ; 

RT "Head and neck squamous cell carcinoma transcriptome analysis by 

RT comprehensive validated differential display."; 

RL Oncogene 25: 1821-1831 (2006) . 

CC -!- SUBCELLULAR LOCATION: Cell membrane; Multi-pass membrane protein. 

CC Cytoplasm. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=l; 

CC IsoId=Q5XXA6-l ; Sequence=Displayed; 

CC Name=2; 

CC IsoId=Q5XXA6-2; Sequence=VSP_025665, VSP_025668, VSP_025669; 

CC Note=No experimental confirmation available; 

CC Name=3; 

CC IsoId=Q5XXA6-3; Sequence=VSP_025666, VSP_025667, VSP_025668, 

CC VSP_025669, VSP_025670, VSP_025671; 

CC Note=No experimental confirmation available; 

CC -!- TISSUE SPECIFICITY: Broadly expressed with higher levels in liver 
CC and skeletal muscle. 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attr ibution-NoDerivs License 

CC 

DR EMBL ; AY728143; AAU82085.1; -; mRNA. 

DR EMBL; AK097619; BAC05123.1; -; mRNA. 

DR EMBL; BC033036; AAH33036.2; -; mRNA. 

DR UniGene; Hs. 503074; -. 

DR Ensembl; ENSG00000131620; Homo sapiens. 

DR HGNC; HGNC: 21625; TMEM16A. 

DR MIM; 610108; gene. 
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DR 


PharmGKB; 


PA1283946 


'■,18; 






DR 


ArrayExpress; Q5XXA6 ; - 






DR 


InterPro; 


IPR007632; DUF590 




DR 


PANTHER; 


PTHR12308; 


DUF590; 


2. 


DR 


Pfam; PF04547; DUF590; 


1. 




PE 


1 : Evidence at protein 


level; 


KW 


Alternative splicing; Cytoplasm; Glycoprotein; Membrane; Polymorphism; 


KW 


Transmembrane . 








FT 


CHAIN 


1 


986 




Transmembrane protein 16A. 


FT 










/FTId=PRO_000028 8435. 


FT 


TOPO_DOM 


1 


333 




Cytoplasmic (Potential). 


FT 


TRANSMEM 


334 


354 




Potential . 


FT 


TOPO_DOM 


355 


402 




Extracellular (Potential) . 


FT 


TRANSMEM 


403 


423 




Potential . 


FT 


TOPO_DOM 


424 


519 




Cytoplasmic (Potential). 


FT 


TRANSMEM 


520 


540 




Potential . 


FT 


TOPO_DOM 


541 


565 




Extracellular (Potential) . 


FT 


TRANSMEM 


566 


586 




Potential . 


FT 


TOPO_DOM 


587 


606 




Cytoplasmic (Potential) . 


FT 


TRANSMEM 


607 


627 




Potential . 


FT 


TOPO_DOM 


628 


731 




Extracellular (Potential) . 


FT 


TRANSMEM 


732 


752 




Potential . 


FT 


TOPO_DOM 


753 


791 




Cytoplasmic (Potential). 


FT 


TRANSMEM 


792 


812 




Potential . 


FT 


TOPO_DOM 


813 


882 




Extracellular (Potential). 


FT 


TRANSMEM 


883 


903 




Potential . 


FT 


TOPO_DOM 


904 


986 




Cytoplasmic (Potential). 


FT 


CARBOHYD 


832 


832 




N-linked (GlcNAc. . .) (Potential). 


FT 


VAR_SEQ 


1 


116 




Missing (in isoform 2). 


FT 










/FTId=VSP_025665. 


FT 


VAR_SEQ 


1 


28 




Missing (in isoform 3). 


FT 










/FTId=VSP_025666 . 


FT 


VAR_SEQ 


29 


36 




GYLPSEGT -> MLTRPSQV (in isoform 3) . 


FT 










/FTId=VSP_025667 . 


FT 


VAR_SEQ 


448 


451 




Missing (in isoform 2 and isoform 3). 


FT 










/FTId=VSP_025668 . 


FT 


VAR_SEQ 


476 


501 




Missing (in isoform 2 and isoform 3). 


FT 










/FTId=VSP_025669 . 


FT 


VAR_SEQ 


651 


700 




CAPGGCLMELCIQLSI IMLGKQLIQNNLFEIGIPKMKKLIR 


FT 










YLKLKQQSP -> VTEILFISGSPFCLAYDLSTPCTWEKQL 


FT 










QHICSAKSSRFLSFLLETFLFP (in isoform 3) . 


FT 










/FTId=VSP_025670 . 


FT 


VAR_SEQ 


701 


986 




Missing (in isoform 3). 


FT 










/FTId=VSP_025671 . 


FT 


VARIANT 


608 


608 




F -> S (in dbSNP:rs2186797) . 


FT 










/FTId=VAR_032417. 


FT 


VARIANT 


983 


983 




G -> R (in dbSNP:rs3740722) . 


FT 










/FTId=VAR_032418 . 


FT 


CONFLICT 


801 


801 




N -> D (in Ref. 3; AAH33036). 


FT 


CONFLICT 


948 


948 




W -> C (in Ref. 3; AAH33036). 


SQ 


SEQUENCE 


986 AA; 


114078 


MW; E30A02F91EF36FC2 CRC64; 



Query Match 35.2%; Score 1450.5; DB 1; Length 986; 

Best Local Similarity 39.1%; Pred. No. 7.1e-112; 

Matches 335; Conservative 149; Mismatches 255; Indels 117; Gaps 
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5 QDGNTTVH YALLSASWAVLCYYAEDLRLKLPLQELPNQASNWSAGLLAWLGIPNVLL 61 

: I : I : I : : I I I I I I I I : I I : I : : : : I : I I I I : I I 

14 4 RDEDTKIHGVGFVKIHAPWNVLCREAEFLKLKMPTKKMYH— INETRGLLK — KINSVLQ 19 9 

62 EVVPDVPPEYYSCR FRVNKLPRFLGSDNQDTFFTSTKRHQILFEILAKTP 111 

: : : I : I I I I I I : I : I I I I I : : I I I : I 

20 0 KITDPIQPKVAEHRPQTMKRLSYPFSREKQHLFDLSD-KDSFFDSKTRSTIVYEILKRTT 2 58 

112 YGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWGKWN 171 

I : : I I I I I I I : I I : I I I I I : : I I : : I : : I I I : I : 

25 9 CTKAKYS-MGITSLLANGVYAAAYPLHDGDY NGENVEFNDRKLLYEEWARYGVFY 312 

172 KYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAWGTLVFLVGCFLVFSDIPTQELCGS 231 

I I I I : I I I : I I I I I : I I I I I I I II I : I I : : I I : I I I II : : I I : I : I 
313 KYQPIDLVRKYFGEKIGLYFAWLGVYTQMLIPASIVGI IVFLYGCATMDENIPSMEMCDQ 3 72 

23 2 KDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKRKSA 2 90 

: : I I I I I I : I : I I I I I I : I Ml: I I I I I : I I I I I I : I : I I I I 
3 73 RHNITMCPLCDKTCSYWKMSSACATARASHLFDNPATVFFSVFMALWAATFMEHWKRKQM 432 

291 TLAYRWDCSDYEDTEE RPRPQFAA SAPMTAPNPITGEDEPYFPERS 336 

I I I I I : : I : I I I I : : I I : I I : I I I 
43 3 RLNYRWDLTGFEEEEEAVKDHPRAEYEARVLEKSLKKESRNK EKRRHIPEESTNKWK 489 

33 7 -RARRMLAG SWIWMVAWVMCLVSIILYRAIMAIWS 3 74 

I : : I I I I : I : I I : : : I : I I I I : : 

49 0 QRVKTAMAGVKLTDKVKLTWRDRFPAYLTNLVSIIFMIAVTFAIVLGVI IYRISMAAALA 549 

3 75 RSGNTLLAAWASRIASLTGSVVNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLK 434 

: : : : : I : : I I I I : : I : : I : I I I : I : : I : II: I 

55 0 MNSSPSVRSNIRVTVTATAVI INLVVI ILLDEVYGCIARWLTKIEVPKTEKSFEERLIFK 6 09 

43 5 VFIFQFVNFYSSPVYI AFFKGRFVGYPGNYHTLF— GVRNEECAAGGCLIELAQELLVIMV 4 93 

I : : I I I I : I : I I I I I I I I I I I : I : I I I I I I I I I I : I I : I : I I : 
610 AFLLKFVNSYTPIFYVAFFKGRFVGRPGDYVYIFRSFRMEECAPGGCLMELCIQLSI IML 669 

49 4 GKQVI-NNMQEVLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEY 552 

I I I : I I I : I : I I I : I : : I : : : : I I I I I II II 

6 70 GKQLIQNNLFEIGIPKMKKLIRYLKLKQQSPPDHEECVKRKQRYEVDYNLEPFAGLTPEY 72 9 

553 LEMVLQFGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHI 612 

: I I : : I I I I I I : I I I : I I I I I I I I I I I : I I I I I I : I I I I I I I I I I I : I I I I I : : I 
73 0 MEMI IQFGFVTLFVASFPLAPLFALLNNI IEIRLDAKKFVTELRRPVAVRAKDIGIWYNI 789 

613 LAGLTHLAVISNAFLLAFSSDFLPRA — YYRWTRAHDLRGFLNFTLARAPSSF 663 

II: I I I I I I I : : : I : I I I : I I I : : : : I I : I I I III 
79 0 LRGIGKLAVI INAFVI SFTSDFIPRLVYLYMYSKNGTMHGFVNHTL SSFNVSDFQN 8 45 

664 AAAHN RTCRYRAFRD DDGHY — SQTYWNLLAIRLAFVIVFEHVVFSVG 709 

II : I I I : : I : : I I : : I : I I I I I I I I I I : : : I : 

8 46 GTAPNDPLDLGYE VQI CRYKDYREPPWSENKYDI SKDFWAVL AARLAFVI VFQNL VMFMS 9 05 

710 RLLDLLVPDIPESVEIKVKREYYLA KQALAENEVLFGTNGTKDEQP 755 

: I : : I I I I : : : : : I I III I I I I I 

906 DFVDWVIPDIPKDISQQIHKEKVLMVELFMREEQDKQQLL — ETWMEKERQKDEPPCNHH 9 63 
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Qy 756 KGSELSSH 763 

II II 

Db 964 NTKACPDSLGSPAPSH 979 



TM1 6 B_HUMAN 

ID TM1 6 B_HUMAN Reviewed; 999 AA. 

AC Q9NQ90; 

DT 16-JAN-2004, integrated into UniProtKB/Swiss-Prot . 

DT 01-OCT-2000, sequence version 1. 

DT 24-JUL-2007, entry version 35. 

DE Transmembrane protein 16B. 

GN Name=TMEM16B; Synonyms=Cl 2orf 3 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 

OC Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] . 

RC TISSUE=Retina; 

RA Lorenz B., White K.E., Econs M.J., Strom T.M.; 

RT "Transcripts in 12pl3.3."; 

RL Submitted (FEB-2000) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION: Membrane; Multi-pass membrane protein 

CC (Probable) . 

CC -!- SIMILARITY: Belongs to the TMEM16 family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attr ibution-NoDer ivs License 

CC 

DR EMBL; AJ272204; CAC01125.1; -; mRNA. 

DR UniGene; Hs. 148970; -. 

DR Ensembl; ENSG00000047617; Homo sapiens. 

DR KEGG; hsa:57101; -. 

DR HGNC; HGNC:1183; TMEM16B. 

DR MIM; 610109; gene. 

DR PharmGKB; PA2550 4; -. 

DR LinkHub; Q9NQ90; -. 

DR ArrayExpress; Q9NQ90; -. 

DR GermOnline; ENSG00000047617 ; Homo sapiens. 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

KW Membrane; Polymorphism; Transmembrane. 

FT CHAIN 1 999 Transmembrane protein 16B. 

FT /FTId=PRO_0000072564. 



FT 


TRANSMEM 


360 


382 


Potential 


FT 


TRANSMEM 


535 


557 


Potential 


FT 


TRANSMEM 


577 


599 


Potential 


FT 


TRANSMEM 


619 


641 


Potential 


FT 


TRANSMEM 


746 


768 


Potential 



http://es/ScoreAccessWeb/GetItem.aciion?.^ 



SCORE Search Results Details for Application 105525 15 and Search Result 200S0624_135830_us-10-552-515-l_copy_157_933.rap. 



T TRANSMEM 796 818 Potential. 

T TRANSMEM 898 920 Potential. 

T VARIANT 108 108 V -> A (in dbSNP : rs3741903 ) . 

T /FTId=VAR_021932 . 

T VARIANT 501 501 S -> A (in dbSNP : rsl860961 ) . 

T /FTId=VAR_020331 . 

Q SEQUENCE 999 AA; 113616 MW; B9B4F56161AE1B00 CRC64; 

Query Match 34.9%; Score 1437; DB 1; Length 999; 

Best Local Similarity 38.7%; Pred. No. 9.8e-lll; 

Matches 326; Conservative 161; Mismatches 263; Indels 92; Gaps 23; 

y 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQELPNQ ASNWSAGLLAW 53 

: : I : : : : : : I I I I I I I : : I : I : : : I : I I I 
b 170 EKDLENKSQGS IFVRIHAPWQVLAREAEFLKIKVPTKKEMYEIKAGGSI AKKFSAAL 226 

y 54 LGIPNVLLEVVPDVPPEYYSCRFRVNKLP RFLGSDNQDTFFTSTKRHQILFEIL 107 

: : I I ||:::: | : I I I I : I : I : I I I 

b 22 7 QKLSSHLQPRV-PEHSNNKMKNLSYPFSREKMYLYNIQEKDTFFDNATRSRI VHEIL 2 82 

y 10 8 AKTPYGHEKKNLLGIHQLLAE GVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARW 16 7 

: I | : | | : | : | : I I : I I I I I : : I : : I I : : I : I I I I : 

b 283 KRTACS-RANNTMGINSLIANNI YEAAYPLHDGEYDSPEDD MNDRKLLYQEWARY 336 

y 168 GKWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQE 227 

I : I : I I : I : I : I I I I I : I I I I I I I II : I : I : : I : I : I I I II : I I I : : I 
b 33 7 GVFYKFQPIDLIRKYFGEKIGLYFAWLGLYTSFLIPSSVIGVIVFLYGCATIEEDIPSRE 3 96 

y 22 8 LCGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWK 2 86 

: I : : : I I I I I I I : I I I I I I III III: I I I I I : I I I I I I : I I I I 
b 39 7 MCDQQNAFTMCPLCDKSCDYWNLSSACGTAQASHLFDNPATVFFSIFMALWATMFLENWK 456 

y 28 7 RKSATLAYRWDCSDYEDTEER PRPQFAA SAPMTAPNPIT G 326 

I I I I I : I : I I I II:: II I 

b 457 RLQMRLGYFWDLTGIEEEEERAQEHSRPEYETKVREKMLKESNQSAVQKLETNTTECGDE 516 

y 32 7 EDEPYFPERSRARRMLAGSVVIVVMVAWVMCLVSI ILYRAIMAIVVSRSGNTLLAAWAS 3 86 

: I I : I I I : I : I : : : I : I I I : I III 
b 517 DDEDKLTWKDRFPGYLMNFAS ILFMI ALTFSIVFGVIVYRITTAAALS LNKATRS 571 

y 38 7 RI ASLTGSVVNLVFILILSKI YVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNF 443 

: : I : : I I I I I I I : I I : : I I I : I : : I : I I : I I I :: I I I 

b 572 NVRVTVTATAVI INLVVILILDEI YGAVAKWLTKIEVPKTEQTFEERLILKAFLLKFVNA 631 

y 444 YSSPVYIAFFKGRFVGYPGNYHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNM 501 

II 1:111111111 I I : I : I I I I I I I I I I I : I I : I : I I : I I I : I II: 

b 632 YSPIFYVAFFKGRFVGRPGSYVYVFDGYRMEECAPGGCLMELCIQLS I IMLGKQLIQNNI 691 

y 502 QEVLIPKLKGWWQKFRLRSKKRKAGASAGA-SQGP — WEDDYELVPCEGLFDEYLEMVLQ 558 

I : : I I I I : I I : : I I : I I : I I : I I I I II I I : I I : : I 

b 692 FEIGVPKLK KLFRKLKDETEAGETDSAHSKHPEQWDLDYSLEPYTGLTPEYMEMI IQ 748 

y 559 FGFVTIFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTH 618 

11111:111: 1111:111111 : I : I I I I : I I I I III I I : I I I I I I I I : I : 
b 749 FGFVTLFVASFPLAPVFALLNNVIEVRLDAKKFVTELRRPDAVRTKDIGIWFDILSGIGK 808 
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619 LAVISNAFLLAFSSDFLPRAYYRWTRAHD — LRGFLNFTLA RAPSSFAA 665 

: I I I I I I : : I : I I I : I I I : : : : I : I I I : I I I : : : I 

80 9 FSVISNAFVIAITSDFIPRLVYQYSYSHNGTLHGFVNHTLSFFNVSQLKEGTQPENSQFD 868 

66 6 AHNRTCRYRAFRD DDGHYSQTYWNLLAIRLAFVI VFEHVVFSVGRLLDLLVPDIP 72 0 

: I I : : : I : : : I : I I : I : I I I I I I : I : : : I : I : I : : I I I I 

86 9 QEVQFCRFKDYREPPWAPNPYEFSKQYWFILSARLAFVI IFQNLVMFLS VLVDWMIPDIP 9 2 8 

721 ESVEIKVKRE YYLAKQALAENEVLFGTNGTKDEQPKGSELSSHWTPFTVPKA-S 7 73 

: : : I : I : : I : I : I I : I I : I : I I 

92 9 TDI SDQIKKEKSLLVDFFLKE EHEKLKLMDEPALRSPGGGDRSRSRAASSAPSGQS 9 84 

774 QL 775 
I I 

985 QL 986 



RESULT 15 
Q8CFW1_M0USE 

ID Q8CFW1_M0USE Unreviewed; 913 AA. 

AC Q8CFW1; 

DT 01-MAR-2003, integrated into UniProtKB/TrEMBL . 

DT 01-MAR-2003, sequence version 1. 

DT 24-JUL-2007, entry version 21. 

DE Transmembrane protein 16B. 

GN Name=Tmeml6b; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Glires; Rodentia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Eye; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=l 0 . 1 0 73 /pnas . 2 426 03 8 9 9 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F . , 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L . , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16 899-16903(2002). 
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RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Eye; 

RG NIH MGC Project; 

RL Submitted (JUN-2002) to the EMBL / GenBank / DDB J databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BC033409; AAH33409.1; -; mRNA. 

DR UniGene; Mm. 333398; -. 

DR Ensembl; ENSMUSG00000038115; Mus musculus. 

DR MGI; MGI : 2387214; Tmeml6b. 

DR GO; GO: 0016021; C : integral to membrane; IEA : UniProtKB-KW . 

DR InterPro; IPR007632; DUF590. 

DR PANTHER; PTHR12308; DUF590; 1. 

DR Pfam; PF04547; DUF590; 1. 

PE 2: Evidence at transcript level; 

KW Transmembrane. 

SQ SEQUENCE 913 AA; 104388 MW; CA1 7DB2 7D81 6 7F6 4 CRC64; 

Query Match 34.8%; Score 1434.5; DB 2; Length 913; 

Best Local Similarity 39.5%; Pred. No. 1.4e-110; 

Matches 310; Conservative 160; Mismatches 242; Indels 73; Gaps 18; 

Qy 1 QQDVQDGNTTVHYALLSASWAVLCYYAEDLRLKLPLQEL PNQASNWSAGLLAWL 5 4 

: : I : : : : : I I I I I I I : : I : I : : : : I : I I : I I 

Db 85 EKDLESKSQGSVFVRIHAPWQVLAREAEFLKIKVPTKKMYEIKAGGSIAKKFSA-ILQTL 143 

Qy 55 GIPNVLLEVVPDVPPEYYSCRFRVNKLP RFLGSDNQDTFFTSTKRHQILFEILA 108 

I : | | | | : : | : | : I I I I : I : I : I I I 

Db 144 SSP LQPRV-PEHSNNRMKNLSYPFSREKMYLYNIQEKDTFFDNATRSRIVHEILK 197 

Qy 10 9 KTPYGHEKKNLLGIHQLLAEGVLSAAFPLHDGPFKTPPEGPQAPRLNQRQVLFQHWARWG 168 

: I | : | | : | : | : I I : I I I I I : : I : : I I : : I : I I I I : I 

Db 198 RTACS-RANNTMGINSLI ANNIYEAAYPLHDGEYDSPGDD MNDRKLLYQEWARYG 251 

Qy 169 KWNKYQPLDHVRRYFGEKVALYFAWLGFYTGWLLPAAVVGTLVFLVGCFLVFSDIPTQEL 228 

: I : I I : I : I : I I I I I : I I I I I I I II : I : I : : I : I : I I I II : I I I : : I : 
Db 252 VFYKFQPIDLIRKYFGEKIGLYFAWLGLYTSFLIPSSVIGVI VFLYGCATIEEDIPSKEM 311 

Qy 22 9 CGSKDSFEMCPLC-LDCPFWLLSSACALAQAGRLFDHGGTVFFSLFMALWAVLLLEYWKR 287 

I : : : I I I I I I I : I I I I I I I : I III: I I I I I : I I I I I I : I I I I I 
Db 312 CDHQNAFTMCPLCDKSCDYWNLSSACGTARASHLFDNPATVFFS IFMALWATMFLENWKR 3 71 

Qy 288 KSATLAYRWDCSDYEDTEER PRPQF AASAPMTAPNPITGE-D 328 

I I I I : I : I I I II:: : I : I : I II 

Db 372 LQMRLGYFWDLTGIEEEEERSQEHSRPEYETKVREKLLKESGKSAVQKLEANSPEDDEDD 431 

Qy 329 EPYFPERSRARRMLAGSVVIVVMVAVWMCLVSI ILYRAIMAIVVSRSGNTLLAAWASRI 388 

I : I I I : I : I : : : I : I I I : I III: 
Db 432 EDKLTWKDRFPGYLMNFAS I LFMI ALTFS I VFGVI VYRI TTAAALS LNKATRSNV 486 

Qy 389 ASLTGSWNLVFILILSKIYVSLAHVLTRWEMHRTQTKFEDAFTLKVFIFQFVNFYS 445 

: I : : I I I I I I I : I I : : I I I : I : : I : I I : I I I :: I I I I I 
Db 48 7 RVTVTATAVI INLVVILILDEI YGAVAKWLTKIEVPKTEQTFEERLILKAFLLKFVNAYS 546 
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Qy 446 SPVYI AFFKGRFVGYPGNYHTLF-GVRNEECAAGGCLIELAQELLVIMVGKQVI-NNMQE 503 

1:111111111 I I : I : I I I I I I I I I I I : I I : I : I I : I I I : I I I : I 
Db 547 PIFYVAFFKGRFVGRPGSYVYVFDGYRMEECAPGGCLMELCIQLSI IMLGKQLIQNNIFE 606 

Qy 50 4 VLIPKLKGWWQKFRLRSKKRKAGASAGASQGPWEDDYELVPCEGLFDEYLEMVLQFGFVT 563 

: : I I I I : : I : : : : : I : I : I I II I I : I I : : I I I I I I 

Db 60 7 IGVPKLKKLFRKLKDETEPGESDPDHSKRPEQWDLDHSLEPYTGLTPEYMEMI IQFGFVT 666 

Qy 564 IFVAACPLAPLFALLNNWVEIRLDARKFVCEYRRPVAERAQDIGIWFHILAGLTHLAVIS 623 

: I I I : 1111:111111 : I : I I I I : I I I I III I I : I I I I I I I I : I : : I I 
Db 66 7 LFVASFPLAPVFALLNNVIEVRLDAKKFVTELRRPDAVRTKDIGIWFDILSGIGKFSVII 726 

Qy 62 4 NAF L L AF S S DF LP RAY YRWTRAHD — LRGFLNFTLA RAP S S F AAAHNRT 6 70 

I I I : : I : I I I : I I I : : : : I : I I I : I I I : : : I : 

Db 72 7 NAFVI AVTSDFIPRLVYQYSYSHNGTLHGFVNHTLSFFNVSQLKEGTQPENSQFDQEVQF 786 

Qy 671 CRYRAFRD DDGHYSQTYWNLLAIRLAFVI VFEHVVFSVGRLLDLLVPDIPESVEI 725 

I I : : : I : : : I : I I : : I : I I I I I I : I : : : I : I : I : : I I I I : 

Db 787 CRFKDYREPPWAPNPYEFSKQYWSVLSARLAFVI IFQNLVMFLSVLVDWMIPDIPTDISD 846 

Qy 72 6 KVKRE 73 0 

: : I : I 

Db 847 QIKKE 851 

Search completed: June 24, 2008, 15:31:04 
Job time : 538 sees 
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